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Substrates for Q 6 - Alkylguanine-DNA Alkyltransfera se 

Fiftlri of the Invention 

5 The present invention relates to methods of transferring a label from novel substrates to O 6 - 
alkylguanine-DNA alkyltransferases (AGT) and 0 6 -alkylguanine-DNA alkyltransferase fusion 
proteins, and to novel substrates suitable in such methods. 



Background of the invention 



10 



The mutagenic and carcinogenic effects of electrophiles such as N-methyl-N-nitrosourea are 
mainly due to the 0 6 -alkylation of guanine in DNA. To protect themselves against DNA- 
alkylation, mammals and bacteria possess a protein, O e -alkylguanine-DNA alkyltransferase 
(AGT) which repairs these lesions. AGT transfers the alkyl group from the position 0-6 of 
1 5 alkylated guanine and guanine derivatives to the mercapto group of one of its own cysteines, 
resulting in an irreversibly alkylated AGT. The underlying mechanism is a nucleophilic 
reaction of the S N 2 type which explains why not only methyl groups, but also benzylic groups 
are easily transferred. As overexpression of AGT in tumour cells is the main reason for 
resistance to alkylating drugs such as procarbazine, dacarbazine, temozolomide and bis-2- 
20 chloroethyl-N-nitrosourea, inhibitors of AGT have been proposed for use as sensitisers in 
chemotherapy (Pegg ef a/., Prog Nucleic Acid Res Mol Biol 51: 167-223, 1995). US 
5,691,307 describes O e -benzylguanines carrying various substituents in the benzyl group, 
and their use for depleting AGT levels in tumor cells and thereby increasing responsiveness 
to alkylating anti-tumor drugs. Likewise, WO 97/20843 discloses further AGT depleting 
25 compounds representing O e -benzyl- and 0 6 -heteroarylmethyl-pyrimidine derivatives. 

DE 19? 03 895 discloses an assay for measuring levels of AGT which relies on the reaction 
between biotinylated O e -alkylguanine derivatives and AGT which leads to biotinylation of the 
AGT. This in turn allows the separation of the AGT on a streptavidin coated plate and its 
30 detection, e.g. in an ELISA assay. The assay is suggested for monitoring the level of AGT in 
tumour tissue and for use in screening for AGT inhibitors. 

WO 01/85221 proposes the use of radiolabeled fluoro- or iodo-substituted O e -benzyl- 
guanines for detection of AGT and monitoring the level of AGT. 



35 



Damoiseaux et ai, ChemBiochem. 4: 285-287, 2001, disclose modified 0 6 -alkylated guanine 
derivatives incorporated into oligodeoxyribonucleotides for use as chemical probes for 
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labelling AGT, again to facilitate detecting the levels of this enzyme in cancer cells to aid in 
research and in chemotherapy. 

PCT/GB02/01636 (WO 02/083937) discloses a method for detecting and/or manipulating a 
5 protein of interest wherein the protein is fused to AGT and the AGT fusion protein contacted 
with an AGT substrate carrying a label, and the AGT fusion protein detected and optionally 
further manipulated using the label. Several AGT fusion proteins to be used, general 
structural principles of the AGT substrate and a broad variety of labels and methods to detect 
the label useful in the method are described. 

10 

Summary of the invention 

The invention relates to a method for detecting and/or manipulating a protein of interest, 
wherein the protein of interest is incorporated into an AGT fusion protein, the AGT fusion 
15 protein is contacted with particular AGT substrates carrying a label, and the AGT fusion 
protein is detected and optionally further manipulated using the label in a system designed 
for recognising and/or handling the label. 

The particular AGT substrates used in the method of the invention are O e -substituted 
20 guanine derivatives or related nitrogen containing hydroxy-heterocycles and their sulfur 

analogs wherein the O e -substituent is an activated methyl derivative suitable for transfer from 
guanine or the corresponding heterocycle to AGT, and further carrying a label. The label 
may consist of a plurality of same or different labels. Activated methyl derivatives are e.g. 
arylmethyl derivatives suitably substituted in the aryl ring, heteroarylmethyl derivatives 
25 suitably substituted in the heteroaryl ring, and ally! type derivatives suitably substituted at the 
double bond. Suitable substituents of the aryl ring, heteroaryl ring or allylic double bond are 
linkers connecting a label to the aryl ring, heteroaryl ring or allyl group, preferably linkers 
which may undergo further modification or cleavage, and also linkers which give rise to 
dimeric or cyclised AGT substrates. The invention relates also to the novel AGT substrates 
30 as such, to methods of manufacture of such novel substrates, and to intermediates useful in 
the synthesis of such novel AGT substrates. 

Brief description of the Figures 
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Figure 1A : Western Blot of an anti-biotin conjugate coupled to HRP (Pierce). 

Lanes 1 and 2: PGEA h AGT-G ST (an AGT fusion protein) after reaction with compound 11 (a 

compound of formula 1 wherein R 3 is triazolylmethyl) and then with a biotinylated O e -benzyl- 
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guanine oligonucleotide known to be an AGT substrate. Lane marked "C n (control): The 
reaction product of PGEA hAGT-GST and the biotinylated 0 6 -benzylguanine oligonucleotide. 

Figure 1 B : Western Blot of a goat anti-fluorescein antibody (Rockland) and a HRP 
5 conjugated anti-goat antibody (Sigma). 

Lanes 1 and 2 (two different concentrations) show the reaction product of compound 60 
bearing a fluorescein label and a biotin label (a compound of formula 1 wherein L is a 
plurality of labels) with PGEA hAGT-GST. The lane on the left side shows molecular weight 
markers of 47 kDa and 79 kDa. 

10 

Figure 2 : Sensogram of a surface plasmon resonance of the reaction product of PGEA hAGT- 
GST (an AGT fusion protein) with compound 54 (compound of formula 1 wherein L is biotin) 
applied to a Streptavidin Biacore chip. The chip is further treated with a solution of an ocGST 
antibody. A: Addition of PGEA hAGT-GST preincubated with 54. B: Addition of buffer solution 
15 (HBST). C: Addition of ocGST antibody. 

Figure 3A : Western Blot with an anti-hAGT antibody and an anti-mouse-HRP conjugate as a 
second antibody. At the bottom of the lane, the ratio of compound 43 (a compound of 
formula 1 wherein L is a further -R3-CH2-X-RrR 2 residue, i.e. a dimerizing compound) to 
20 PGEG hAGT-GST is shown. 

Figure 3B : Western Blot with a goat anti-hAGT and an anti-goat IgG of the in vivo reaction 
product of dimerizing compound 61 with fusion protein HA- w160 hAGT in E. colL At the bottom 
of the lane, different concentrations of 61 (a compound of formula 1 wherein L is a further 
25 -R 3 -CH 2 -X-R 1 -R 2 residue), are indicated. 

Figure 4 : Schematic representation of hAGT-based three-hybrid system. On reaction of the 
hAGT-LexA fusion protein with the compound of formula 1 wherein L is methotrexate (M) in 
the presence of a fusion protein DHFR-B42, hAGT-LexA and DHFR-B42 are coupled and 
30 transcription started. 

Detailed description of the invention 

In the present invention a protein or peptide of interest is fused to an O s -alkylguanine-DNA 
35 alkyltransferase (AGT). The protein or peptide of interest may be of any length and both with 
and without secondary, tertiary or quaternary structure, and preferably consists of at least 
twelve amino acids and up to 2000 amino acids. Examples of such protein or peptide of 
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interest are provided below, and are e.g. enzymes, DNA-binding proteins, transcription 
regulating proteins, membrane proteins, nuclear receptor proteins, nuclear localization signal 
proteins, protein cofactors, small monomeric GTPases, ATP-binding cassette proteins, 
intracellular structural proteins, proteins with sequences responsible for targeting proteins to 
5 particular cellular compartments, proteins generally used as labels or affinity tags, and 

domains or subdomains of the aforementioned proteins. The protein or peptide of interest is 
preferably fused to AGT by way of a linker which may be cleaved by an enzyme, e.g. at the 
DNA stage by suitable restriction enzymes, e.g. AGATCT cleavable by Bgl //, and/or linkers 
cleavable by suitable enzymes at the protein stage, e.g. tobacco etch virus Nla (TEV) 
10 protease. Fusion proteins may be expressed in prokaryotic hosts, preferably E. coli, or 
eukaryotic host, e.g. yeast, insect or mammalian cells. 



The O e -alkylguanine-DNA alkyltransferase (AGT) has the property of transferring a label 
present on a substrate to one of the cysteine residues of the AGT forming part of a fusion 

15 protein. In preferred embodiments, the AGT is a known human 0 6 -alkylguanine-DNA 
alkyltransferase, hAGT. Murine or rat forms of the enzyme are also considered provided 
they have similar properties in reacting with a substrate like human AGT. In the present 
invention, O e -alkylguanine-DNA alkyltransferase also includes variants of a wild-type AGT 
which may differ by virtue of one or more amino acid substitutions, deletions or additions, but 

20 which still retain the property of transferring a label present on a substrate to the AGT part of 
the fusion protein. AGT variants may be obtained by chemical modification using techniques 
well known to those skilled in the art. AGT variants may preferably be produced using 
protein engineering techniques known to the skilled person and/or using molecular evolution 
to generate and select new O e -aIkyIguanine-DNA alkyltransferases. Such techniques are 

25 e.g. saturation mutagenesis, error prone PGR to introduce variations anywhere in the 
sequence, DNA shuffling used after saturation mutagenesis and/or error prone PCR, or 
family shuffling using genes from several species. 

The fusion protein comprising protein of interest and an O e -alkylguanine-DNA alkyl- 
30 transferase (AGT) is contacted with a particular substrate having a label. Conditions of 

reaction are selected such that the AGT reacts with the substrate and transfers the label of 
the substrate. Usual conditions are a buffer solution at around pH 7 at room temperature, 
e.g. around 25°C. However, it is understood that AGT reacts also under a variety of other 
conditions, and those conditions mentioned here are not limiting the scope of the invention. 

35 

AGT irreversibly transfers the alkyl group from its substrate, 0 6 -alkyIguanine-DNA, to one of 
its cysteine residues. A substrate analogue that rapidly reacts with hAGT is O e -benzyl- 
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guanine, the second order rate constant being approximately 10 3 sec" 1 M" 1 . Substitutions of 
O s -benzyIguanine at the C-4 of the benzyl ring do not significantly affect the reactivity of 
hAGT against O e -benzylguanine derivatives, and this property has been used to transfer a 
label attached to the C-4 of the benzyl ring to AGT. 

The label part of the substrate can be chosen by those skilled in the art dependent on the 
application for which the fusion protein is intended. After contacting the fusion protein 
comprising AGT with the substrate, the label is covalently bonded to the fusion protein. The 
labelled AGT fusion protein is then further manipulated and/or detected by virtue of the 
transferred label. The label may consist of a plurality of same or different labels. If the 
substrate contains more than one label, the corresponding labelled AGT fusion protein will 
also comprise more than one label which gives more options for further manipulating and/or 
detecting the labelled fusion protein. 

The particular AGT substrates are compounds of the formula 1 

/CH 2 -R 3 

X 
I 

Rf 1 

wherein RrR 2 is a group recognized by AGT as a substrate; 
X is oxygen or sulfur; 

R 3 is an aromatic or a heteroaromatic group, or an optionally substituted unsaturated alkyl, 
cycloalkyl or heterocyclyl group with the double bond connected to CH 2 ; 
R4 is a linker; and 

L is a label, a plurality of same or different labels, a bond connecting R4 to R A forming a cyclic 
substrate, or a further group -R 3 -CH2-X-RrRz 

In a group R1-R2, the residue R 1 is preferably a heteroaromatic group containing 1 to 5 
nitrogen atoms, recognized by AGT as a substrate. 

A heteroaromatic group R 1 is mono- or bicyclic and has 5 to 12, preferably 6 or 9 or 10 ring 
atoms; which in addition to carrying a substituent R 2 may be unsubstituted or substituted by 
one or more, especially one, two or three further substituents selected from the group 
consisting of lower alkyl, such as methyl, lower alkoxy, such as methoxy or ethoxy, hydroxy, 
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oxo f amino, lower alkylamino, di-lower alkylamino, acylamino, halogen, such as chlorine or 
bromine, halogenated lower alkyl, such as trifluoromethyl, carboxy, lower alkoxycarbonyl, 
carbamoyl, lower aikylcarbamoyl, or lower alkylcarbonyl. 

5 Lower alkyl is preferably alkyl with 1 to 7, preferably from 1 to 4 C atoms, and is linear or 
branched; preferably, lower alkyl is butyl, such as n-butyl, sec-butyl, isobutyi, tert-butyl, 
propyl, such as n-propyl or isopropyl, ethyl or methyl. Preferably lower alkyl is methyl. 

In lower alkoxy, the lower alkyl group is as defined hereinbefore. Lower alkoxy denotes 
10 preferably n-butoxy, tert-butoxy, iso-propoxy, ethoxy, or methoxy, in particular methoxy. 

Preferably the mono- or bicyclic heteroaromatic group Rj is selected from 2H-pyrrolyl, 
pyrrolyl, imidazolyl, benzimidazolyl, pyrazolyl, indazolyl, purinyl, 8-azapurinyl, pyridyl, 
pyrazinyl, pyrimidinyl, pyridazinyl, 4H-quinolizinyl, isoquinolyl, quinolyl, phthalazinyl, 
15 naphthyridinyl, quinoxalyl, quinazolinyl, quinolinyl, pteridinyl, indoiizinyl, 3H-indolyl, indolyl, 
isoindolyl, triazolyl, tetrazolyl, or benzo[d]pyrazolyl. More preferably the mono- or bicyclic 
heteroaromatic group R n is selected from the group consisting purinyl, 8-azapurinyl, pyridyl, 
pyrazinyl, pyrimidinyl, and pyridazinyl. 

20 For example the group R1-R2 may be a purine radical of formula 2 



wherein R 2 is hydrogen, alkyl of 1 to 10 carbon atoms, or a saccharide moiety; 
25 R 5 is hydrogen, halogen, e.g. chloro or bromo, trifluoromethyl, or hydroxy; and 
Re is hydrogen, hydroxy or unsubstituted or substituted amino. 

If R 5 or Re is hydroxy, the purine radical is predominantly present in its tautomeric form 
wherein a nitrogen adjacent to the carbon atom bearing R 5 or Re carries a hydrogen atom, 
30 the double bond between this nitrogen atom and the carbon atom bearing R 5 or Re is a single 
bond, and R 5 or R 6 is double bonded oxygen, respectively. 
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A substituted amino group Re is lower alkylamino of 1 to 4 carbon atoms or acylamino, 
wherein the acyl group is lower alkylcarbonyl with 1 to 5 carbon atoms, e.g. acetyl, propionyl, 
n- or isopropylcarbonyl, or n-, iso- or tert-butylcarbonyl, or arylcarbonyl, e.g. benzoyl. 

If Rq is unsubstituted or substituted amino and the residue X connected to the bond of the 
purine radical is oxygen, the residue of formula 2 is a guanine derivative. 

R 2 as alkyl of 1 to 10 carbon atoms is linear or branched and includes lower alkyl of 1 to 4 
carbon atoms, e.g. methyl, ethyl, butyl, such as n-butyl, sec-butyl, isobutyl or tert-butyl, and 
propyl, such as n-propyl or isopropyl. R 2 as alkyl may also be pentyl, hexyl, heptyl, octyl, 
nonyl, ordecyl, e.g. n-hexyl. 

A saccharide moiety R 2 is a saccharide monomer or oligomer connected with a spacer of 
variable length to the N 9 position of the guanine base: The spacer in this context is an alkyl 
chain preferably from 1 to 15 carbon atoms, a polyethylene glycol spacer consisting of 1 to 
200 ethylene glycol units, an amide group -CO-NH-, an ester group -CO-O-, an alkylene 
group -CH=CH- or a combination of alkyl chain, polyethylene glycol group, amide group, 
ester group, and/or alkylene group. 

In the context of this invention, a saccharide moiety R 2 further includes a p-D-2'-deoxyribosyl, 
or a p-D-2'-deoxyribosyl being incorporated into a single stranded oligodeoxyribonucleotide 
having a length of 2 to 99 nucleotides, wherein the guanine derivative R-i occupies any 
position within the oligonucleotide sequence. 

Particularly preferred are compounds wherein the group Ri-R 2 is a purine radical of formula 
2, R 2 is hydrogen, R 5 is hydrogen, Re is unsubstituted amino, and X is oxygen, i.e. 
unsubstituted guanine derivatives. 

In another preferred embodiment of the invention the group R r R 2 is a 8-azapurine radical of 
formula 3 
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wherein the substituents R 2 and Re have the meaning as defined for R 2 and Rs under 
formula 2. 

In a further preferred embodiment of the invention the group R^R 2 is a pyrimidine radical of 
5 formula 4 




4 



wherein the substituent R 2 has the meaning as defined under formula 2, and is preferably 
hydrogen; and 

10 R 7 and Rq are both independently of one another hydrogen, halogen, e.g. chloro or bromo, 
lower alkyl with 1 to 4 carbon atoms, e.g. methyl, amino, or nitro. 

X is preferably oxygen. 

15 R 3 as an aromatic or a heteroaromatic group, or an optionally substituted unsaturated alkyl, 
cycloalkyl or heterocyclyl group is a group sterically and electronically accepted by AGT (in 
accordance with its reaction mechanism) which allows the covalent transfer of the R3-R4-L 
unit to the fusion protein. In a R3-R4-L unit, R4-L may also have the meaning of a plurality of 
same or different linkers R4 carrying a plurality of same or different labels L 

20 

R 3 as an aromatic group is preferably phenyl or naphthyl, in particular phenyl, e.g. phenyl 
substituted by R4 in para or meta position. 

A heteroaromatic group R 3 is a mono- or bicyclic heteroaryl group comprising zero, one, two, 
25 three or four ring nitrogen atoms and zero or one oxygen atom and zero or one sulfur atom, 
with the proviso that at least one ring carbon atom is replaced by a nitrogen, oxygen or sulfur 
atom, and which has 5 to 12, preferably 5 or 6 ring atoms; and which in addition to carrying a 
substituent R4 may be unsubstituted or substituted by one or more, especially one, further 
substituents selected from the group consisting of lower alkyl, such as methyl, lower alkoxy, 
30 such as methoxy or ethoxy, halogen, e.g. chlorine, bromine or fluorine, halogenated lower 
alkyl, such as trifluoromethyl, or hydroxy. 
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Preferably the mono or bicyclic heteroaryl group R 3 is selected from 2H-pyrrolyl, pyrrolyl, 
imidazolyl, benzimidazolyl, pyrazolyl, indazolyl, purinyl, pyridyl, pyrazinyl, pyrimidinyl, 
pyridazinyl, 4H-quinoIizinyl, isoquinolyl, quinolyl, phthalazinyl, naphthyridinyl, quinoxalyl, 
quinazolinyl, quinolinyl, pteridinyl, indolizinyl, 3H-indo!yl, indolyl, isoindolyl, oxazolyl, 
5 isoxazolyl, thiazolyl, isothiazolyl, triazolyl, tetrazolyl, furazanyl, benzo[d]pyrazolyl, thienyl, and 
furanyl. More preferably the mono- or bicyclic heteroaryl group is selected from the group 
consisting of pyrrolyl, imidazolyl, such as 1H-imidazol-1-yl, benzimidazolyl, such as 1- 
benzimidazolyl, indazolyl, especially 5-indazoIyI, pyridyl, e.g. 2-, 3- or 4-pyridyI, pyrimidinyl, 
especially 2-pyrimidinyl, pyrazinyl, isoquinolinyl, especially 3-isoquinoIinyI, quinolinyl, 
10 especially 4- or 8-quinolinyl, indolyl, especially 3-indolyl, thiazolyl, triazolyl, tetrazolyl, 
benzo[d]pyrazolyl, thienyl, and furanyl. 

In a particularly preferred embodiment of the invention the heteroaryl group R 3 is triazolyl, 
especially 1 -triazolyl, carrying the further substituent R4 in the 4- or 5-position, tetrazolyl, 

15 especially 1 -tetrazolyl, carrying the further substituent R4 in the 4- or 5-position, or 2- 

tetrazolyl carrying the further substituent R4 in 5-position, isoxazolyl, especially 3-isoxazolyl 
carrying the further substituent R4 in 5-position, or 5-isoxazolyl, carrying the further 
substituent R4 in 3-position, or thienyl, especially 2-thienyl, carrying the further substituent R4 
in 3-, 4- or 5-position, preferably 4-position, or 3-thienyl, carrying the further substituent R4 in 

20 4-position. 

Most preferred is the heteroaryl group R 3 as triazolyl, carrying the substituent R4 in 4- or 5- 
position, and also R 3 as 2-thienyl carrying the substituent R4 in 4- or 5-position. 

25 An optionally substituted unsaturated alkyl group R3 is 1-alkenyl carrying the further 
substituent R4 in 1- or 2-position, preferably in 2-position, or 1-alkynyl. Substituents 
considered in 1-alkenyl are e.g. lower alkyl, e.g. methyl, lower alkoxy, e.g. methoxy, lower 
acyloxy, e.g. acetoxy, or halogenyl, e.g. chloro. In a particularly preferred embodiment of the 
invention R 3 is 1-alkynyl. 

30 

An optionally substituted unsaturated cycloalkyl group is a cycloalkyl group with 3 to 7 carbon 
atoms unsaturated in 1-position, e.g. 1-cyclopentyl or 1-cyclohexyl, carrying the further 
substituent R4 in any position. Substituents considered are e.g. lower alkyl, e.g. methyl, 
lower alkoxy, e.g. methoxy, lower acyloxy, e.g. acetoxy, or halogenyl, e.g. chloro. 

35 

An optionally substituted unsaturated heterocyclyl group has 3 to 12 atoms, 1 to 5 
heteroatoms selected from nitrogen, oxygen and sulfur, and a double bond in the position 
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connecting the heterocyclyl group to methylene CH 2 . Substituents considered are e.g. lower 
alkyl, e.g. methyl, lower alkoxy, e.g. methoxy, lower acyloxy, e.g. acetoxy, or halogenyl, e.g. 
chloro. 

5 In particular, an optionally substituted unsaturated heterocyclyl group is a partially saturated 
heteroaromatic group as defined hereinbefore for a heteroaromatic group R 3 . An example of 
such a heterocyclyl group is isoxazolidinyl, especially 3-isoxazolidinyl carrying the further 
substituent in 5-position, or 5-isoxazoIidinyl, carrying the further substituent in 3-position. 

10 A linker group R4 is preferably a flexible linker connecting a label L or a plurality of same or 
different labels L to the substrate. Linker units are chosen in the context of the envisioned 
application, i.e. in the transfer of the substrate to a fusion protein comprising AGT. They also 
increase the solubility of the substrate in the appropriate solvent. The linkers used are 
chemically stable under the conditions of the actual application. The linker does not interfere 

15 with the reaction with AGT nor with the detection of the label L, but may be constructed such 
as to be cleaved at some point in time after the reaction of the compound of formula 1 with 
the fusion protein comprising AGT. 

A linker R4 is a straight or branched chain alkylene group with 1 to 300 carbon atoms, 
20 wherein optionally 

(a) one or more carbon atoms are replaced by oxygen, in particular wherein every third 
carbon atom is replaced by oxygen, e.g. a poylethyieneoxy group with 1 to 100 ethyleneoxy 
units; 

(b) one or more carbon atoms are replaced by nitrogen carrying a hydrogen atom, and the 
25 adjacent carbon atoms are substituted by oxo, representing an amide function -NH-CO-; 

(c) one or more carbon atoms are replaced by oxygen, and the adjacent carbon atoms are 
substituted by oxo, representing an ester function -O-CO-; 

(d) the bond between two adjacent carbon atoms is a double or a triple bond, representing a 
function -CH=CH- or -CEC-; 

30 (e) one or more carbon atoms are replaced by a phenylene, a saturated or unsaturated 

cycloalkylene, a saturated or unsaturated bicycloalkylene, a bridging heteroaromatic or a 

bridging saturated or unsaturated heterocyclyl group; 

(f) two adjacent carbon atoms are replaced by a disulfide linkage -S-S-; 

or a combination of two or more, especially two or three, alkylene and/or modified alkylene 
35 groups as defined under (a) to (f) hereinbefore, optionally containing substituents. 
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Substituents considered are e.g. lower alkyl, e.g. methyl, lower alkoxy, e.g. methoxy, lower 
acyloxy, e.g. acetoxy, or halogenyl, e.g. chloro. 

Further substituents considered are e.g. those obtained when an a-amino acid, in particular a 
naturally occurring a-amino acid, is incorporated in the linker R4 wherein carbon atoms are 
replaced by amide functions-NH-CO- as defined under (b). In such a linker, part of the 
carbon chain of the alkylene group R4 is replaced by a group -{NH-CHR-CO) n - wherein n is 
between 1 and 100 and R represents a varying residue of an a-amino acid. 

A further substituent is one which leads to a photocleavable linker R4, e.g. an o-nitrophenyl 
group. In particular this substituent o-nitrophenyl is located at a carbon atom adjacent to an 
amide bond, e.g. in a group -NH-CO-CH2-CH(o-nitrophenyl)-NH-CO-, or as a substituent in a 
polyethylene glycol chain, e.g. in a group -0-CH2-CH(o-nitrophenyl)-0-. Other photo- 
cleavable linkers considered are, e.g. phenacyl, alkoxybenzoin, benzylthioether and pivaloyl 
glycol derivatives. 

A phenylene group replacing carbon atoms as defined under (e) hereinbefore is e.g. 1,2-, 
1,3-, or preferably 1,4-phenylene. In a particular embodiment, the phenylene group is further 
substituted by a nitro group, and, combined with other replacements as mentioned above 
under (a), (b), (c), (d), and (f), represents a photocleavable group, and is e.g. 4-nitro-1,3- 
phenylene, such as in -CO-NH-CH2-4-nitro-1,3-phenylene-CH(CH 3 )-0-CO-, or 2-methoxy-5- 
nitro-1,4-phenylene, such as in -CH2-0-2-methoxy-5-nitro-1,4-phenylene-CH(CH 3 )-0-. Other 
particular embodiments representing photocleavable linkers are e.g. -1 ,4-phenylene-CO- 
CH2-O-CO-CH2- (a phenacyl group), -1 ,4-phenylene-CH(OR)-CO-1 ,4-phenylene- (an 
alkoxybenzoin), or-3,5-dimethoxy-1,4-phenylene-CH 2 -0- (a dimethoxybenzyl moiety). A 
saturated or unsaturated cycloalkylene group replacing carbon atoms as defined under (e) 
hereinbefore is derived from cycloalkyl with 3 to 7 carbon atoms, preferably from cyclopentyl 
or cyclohexyl, and is e.g. 1,2- or 1,3-cycIopentylene, 1,2-, 1,3-, or preferably 1 ,4-cyclo- 
hexylene, or also 1,4-cyclohexylene being unsaturated e.g. in 1- or In 2-position. A saturated 
or unsaturated bicycloalkylene group replacing carbon atoms as defined under (e) 
hereinbefore is derived from bicycloalkyl with 7 or 8 carbon atoms, and is e.g. bicyclo[2.2.1] 
heptylene or bicyclo[2.2.2]octylene, preferably 1,4-bicyclo[2.2.1]heptylene optionally 
unsaturated in 2-position or doubly unsaturated in 2- and 5-position, and 1 ,4-bicyclo[2.2.2] 
octylene optionally unsaturated in 2-position or doubly unsaturated in 2- and 5-position. A 
bridging heteroaromatic group replacing carbon atoms as defined under (e) hereinbefore is 
e.g. triazolidene, preferably 1 ,4-triazolidene, or isoxazolidene, preferably 3,5-isoxazolidene. 
A bridging saturated or unsaturated heterocyclyl group replacing carbon atoms as defined 
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under (e) hereinbefore is e.g. derived from an unsaturated heterocyclyl group as defined 
under R 3 above, e.g. isoxazolidinene, preferably 3,5-isoxazoIidinene, or a fully saturated 
heterocyclyl group with 3 to 12 atoms, 1 to 3 of which are heteroatoms selected from 
nitrogen, oxygen and sulfur, e.g. pyrrolidinediyl, piperidinediyl, tetrahydrofuranediyl, 
5 dioxanediyl, morpholinediyl or tetrahydrothiophenediyl, preferably 2,5-tetrahydrofuranediyI or 
2,5-dioxanediyi. A particular heterocyclyl group considered is a saccharide moiety, e.g. an 
a- or 0-furanosyI or a- or p-pyranosyl moiety. 

Cyclic substructures in a linker R4 reduce the molecular flexibility as measured by the 
1 0 number of rotatable bonds within R4, which leads to a better membrane permeation rate, 
important for all in vivo labelling applications. 

A linker R4 is preferably a straight chain alkylene group with 1 to 25 carbon atoms or a 
straight chain polyethylene glycol group with 4 to 100 ethyleneoxy units, optionally attached 

1 5 to the group R 3 by a -CH=CH- or -CEC- group. Further preferred is a straight chain alkylene 
group with 1 to 25 carbon atoms wherein carbon atoms are optionally replaced by an amide 
function -NH-CO-, and carrying a photocleavable subunit, e.g. o-nitrophenyl. Further 
preferred are branched linkers comprising a polyethylene glycol group of 3 to 6 ethylene 
glycol units and alkylene groups wherein carbon atoms are replaced by amide bonds, and 

20 further carrying substituted amino and hydroxy functions. Other preferred branched linkers 
have dendritic (tree-like) structures wherein amine, carboxamide and/or ether functions 
replace carbon atoms of an alkylene group. 

A particularly preferred linker R4 is a straight chain alkylene group of 10 to 40 carbon atoms 
25 wherein 3 to 12 carbon atoms are replaced by oxygen, one or two carbon atoms are 

replaced by one or two 1 ,4-triazolidene units, respectively, and optionally one carbon atom is 
replaced by a 1 ,4-phenylene unit. 

Another particularly preferred linker R4 is a straight chain alkylene group of 10 to 40 carbon 
30 atoms optionally substituted by oxo wherein 3 to 12 carbon atoms are replaced by oxygen 
and one or two carbon atoms are replaced by nitrogen. 

Another particularly preferred linker R4 is a straight chain alkylene group of 6 to 40 carbon 
atoms wherein 2 to 12 carbon atoms are replaced by oxygen and one or two bonds between 
35 two adjacent carbon atoms is a double bond representing a function -CH=CH-. 
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A linker R4 may carry one or more same or different labels, e.g. 1 to 100 same or different 
labels, in particular 1 to 5, preferably one, two or three, in particular one or two same or 
different labels. 

5 The label part L of the substrate can be chosen by those skilled in the art dependent on the 
application for which the fusion protein is intended. Labels may be e.g. such that the labelled 
fusion protein is easily detected or separated from its environment. Other labels considered 
are those which are capable of sensing and inducing changes in the environment of the 
labelled fusion protein and/or labels which aid in manipulating the fusion protein by the 
10 physical and/or chemical properties specifically introduced by the label to the fusion protein. 

Examples of labels L include a spectroscopic probe such as a fluorophore, a chromophore, a 
magnetic probe or a contrast reagent; a radioactively labelled molecule; a molecule which is 
one part of a specific binding pair which is capable of specifically binding to a partner; a 

15 molecule that is suspected to interact with other biomolecules; a library of molecules that are 
suspected to interact with other biomolecules; a molecule which is capable of crosslinking to 
other molecules; a molecule which is capable of generating hydroxyl radicals upon exposure 
to H 2 0 2 and ascorbate, such as a tethered metal-chelate; a molecule which is capable of 
generating reactive radicals upon irradiation with light, such as malachite green; a molecule 

20 covalently attached to a solid support, where the support may be a glass slide, a microtiter 
plate or any polymer known to those proficient in the art; a nucleic acid or a derivative thereof 
capable of undergoing base-pairing with its complementary strand; a lipid or other 
hydrophobic molecule with membrane-inserting properties; a biomolecule with desirable 
enzymatic, chemical or physical properties; or a molecule possessing a combination of any 

25 of the properties listed above. Preferred are labels L as mentioned hereinbefore with the 

exception of a radioactively labelled molecule. Most preferred as labels L are spectroscopic 
probes, and molecules which are one part of a specific binding pair which is capable of 
specifically binding to a partner, so-called affinity labels. 

30 When the label L is a fluorophore, a chromophore, a magnetic label, a radioactive label or 

the like, detection is by standard means adapted to the label and whether the method is used 
in vitro or in vivo. The method can be compared to the applications of the green fluorescent 
protein (GFP) which is genetically fused to a protein of interest and allows protein 
investigation in the living cell. Particular examples of labels L are also boron compounds 

♦ 35 displaying non-linear optical properties, or a member of a FRET pair which changes its 

spectroscopic properties on reaction of the labelled substrate with the AGT fusion protein. 



WO 2004/031405 PCT/EP2003/0 10889 

-14- 

Depending on the properties of the label L, the fusion protein comprising protein of interest 
and AGT may be bound to a solid support. The label of the substrate reacting with the fusion 
protein comprising AGT may already be attached to a solid support when entering into 
reaction with AGT, or may subsequently, i.e. after transfer to AGT, be used to attach the 
5 AGT fusion protein to a solid support. The label may be one member of a specific binding 
pair, the other member of which is attached or attachable to the solid support, either 
covalently or by any other means. A specific binding pair considered is e.g. biotin and avidin 
or streptavidin. Either member of the binding pair may be the label L of the substrate, the 
other being attached to the solid support. Further examples of labels allowing convenient 

1 0 binding to a solid support are e.g. maltose binding protein, glycoproteins, FLAG tags, or 
reactive substituents allowing chemoselective reaction between such substituent with a 
complementary functional group on the surface of the soiid support. Examples of such pairs 
of reactive substituents and complementary functional group are e.g. amine and activated 
carboxy group forming an amide, azide and a propiolic acid derivative undergoing a 1,3- 

1 5 dipolar cycloaddition reaction, amine and another amine functional group reacting with an 
added Afunctional linker reagent of the type of activated bis-dicarboxylic acid derivative 
giving rise to two amide bonds, or other combinations known in the art. 

Examples of a convenient solid support are e.g. glass surfaces such as glass slides, 
20 microtiter plates, and suitable sensor elements, in particular functionalized polymers (e.g. in 
the form of beads), chemically modified oxidic surfaces, e.g. silicon dioxide, tantalum 
pentoxide or titanium dioxide, or also chemically modified metal surfaces, e.g. noble metal 
surfaces such as gold or silver surfaces. Irreversibly attaching and/or spotting AGT 
substrates may then be used to attach AGT fusion proteins in a spatially resolved manner, 
25 particularly through spotting, on the solid support representing protein microarrays, DNA 
microarrays or arrays of small molecules. 

When the label L is capable of generating reactive radicals, such as hydroxyl radicals, upon 
exposure to an external stimulus, the generated radicals can then inactivate the AGT fusion 

30 proteins as well as those proteins that are in close proximity of the AGT fusion protein, 
allowing to study the role of these proteins. Examples of such labels are tethered metal- 
chelate complexes that produce hydroxyl radicals upon exposure to H 2 0 2 and ascorbate, and 
chromophores such as malachite green that produce hydroxyl radicals upon laser irradiation. 
The use of chromophores and lasers to generate hydroxyl radicals is also known in the art as 

35 chromophore assisted laser induced inactivation (CALI). In the present invention, labelling 
AGT fusion proteins with chromophores such as malachite green and subsequent laser 
irradiation inactivates the AGT fusion protein as well as those proteins that interact with the 
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AGT fusion protein in a time-controlled and spatially-resolved manner. This method can be 
applied both in vivo or in vitro. Furthermore, proteins which are in close proximity of the AGT 
fusion protein can be identified as such by either detecting fragments of that protein by a 
specific antibody, by the disappearance of those proteins on a high-resolution 2D- 
5 electrophoresis gels or by identification of the cleaved protein fragments via separation and 
sequencing techniques such as mass spectrometry or protein sequencing by N-terminal 
degradation. 

When the label L is a molecule that can cross-link to other proteins, e.g. a molecule 
10 containing functional groups such as maleimides, active esters or azides and others known 
to those proficient in the art, contacting such labelled AGT substrates with AGT fusion 
proteins that interact with other proteins (in vivo or in vitro) leads to the covalent cross-linking 
of the AGT fusion protein with its interacting protein via the label. This allows the 
identification of the protein interacting with the AGT fusion protein. Labels L for photo cross- 
15 linking are e.g. benzophenones. In a special aspect of cross-linking the label L is a molecule 
which is itself an AGT substrate leading to dimerization of the AGT fusion protein. The 
chemical structure of such dimers may be either symmetrical (homodimers) or unsymmetrical 
(heterodimers). 

20 Other labels L considered are for example fullerenes, boranes for neutron capture treatment, 
nucleotides or oligonucleotides, e.g. for self-addressing chips, peptide nucleic acids, and 
metal chelates, e.g. platinum chelates that bind specifically to DNA. 

A particular biomolecule with desirable enzymatic, chemical or physical properties is 
25 methotrexate. Methotrexate is a tight-binding inhibitor of the enzyme dihydrofolate reductase 
(DHFR). Compounds of formula 1 wherein L is methotrexate belong to the well known class 
of so-called "chemical inducers of dimerization" (CIDs). Using fusion proteins of hAGT with 
the DNA-binding domain LexA, and adding DHFR with the transcriptional activation domain 
B42 to the in vivo labeling of the hAGT fusion protein with a compound of formula 1 wherein 
30 L is methotrexate induces the coupling ("dimerization") of the hAGT-LexA fusion protein and 
DHFR-B42 fusion protein, leading to spatial proximity of LexA and B42 and subsequent 
stimulation of transcription as shown in Figure 4. 

If the substrate carries two or more labels, these labels may be identical or different. 
35. Particular preferred combinations are two different affinity labels, or one affinity label and one 
chromophore label, in particular one affinity label and one fluorophore label. 
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The present invention further provides a method to label AGT fusion proteins both in vivo as 
well as in vitro. The term in vivo labelling of a AGT fusion protein includes labelling in all 
compartments of a cell as well as of AGT fusion proteins pointing to the extracellular space. 
If the labelling of the AGT fusion protein is done in vivo and the protein fused to the AGT is a 
5 membrane protein, more specifically a plasma membrane protein, the AGT part of the fusion 
protein can be attached to either side of the membrane, e.g. attached to the cytoplasmic or 
the extracellular side of the plasma membrane. 

If the labelling is done in vitro, the labelling of the fusion protein can be either performed in 
10 cell extracts or with purified or enriched forms of the AGT fusion protein. 

If the labelling is done in vivo or in cell extracts, the labelling of the endogenous AGT of the 
host is advantageously taken into account. If the endogenous AGT of the host does not 
accept O e -alkylguanine derivatives or related compounds as a substrate, the labelling of the 
15 fusion protein is specific. In mammalian cells, e.g. in human, murine, or rat cells, labelling of 
endogenous AGT is possible. In those experiments where the simultaneous labelling of the 
endogenous AGT as well as of the AGT fusion protein poses a problem, known AGT- 
deficient cell lines can be used. 

20 In a particular aspect, the present invention provides a method of determining the interaction 
of a candidate compound or library of candidate compounds and a target protein or library of 
target proteins. Examples of candidate compounds and target proteins include ligands and 
proteins, drugs and targets of the drug, or small molecules and proteins. In this particular 
method of the invention, the protein of interest fused to the AGT comprises a DNA binding 

25 domain of a transcription factor or an activation domain of a transcription factor. The putative 
protein target of the substances or library of proteins is linked to either of the DNA binding 
domain or the activation domain of the transcription factor in a way a functional transcription 
factor can be formed, and the label L of the AGT substrate according to the invention is a 
candidate compound or library of candidate compounds suspected of interacting with the 

30 target substance or substances. The candidate compound or library of candidate 

compounds being part of the substrate is then transferred to the AGT fusion protein. On 
transfer the AGT fusion protein(s) comprising the target substance(s) now are labelled with 
the candidate compound(s). The interaction of a candidate compound joined to the AGT 
fusion protein with the target protein fused to either the DNA binding domain or the activation 

35 domain leads to the formation of a functional transcription factor. The activated transcription 
factor can then drive the expression of a reporter which, if the method is carried out in cells, 
can be detected if the expression of the reporter confers a selective advantage on the cells. 
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In particular embodiments, the method may involve one or more further steps such as 
detecting, isolating, identifying or characterising the candidate compound(s) or target 
substance(s). 

5 in a specific example the label L is a drug or a biological active small molecule that binds to a 
yet unidentified protein Y. A cDNA library of the organism which is expected to express the 
unknown target protein Y is fused to the activation domain of a transcription factor, and the 
AGT is fused to the DNA binding domain of a transcription factor. Adding the AGT substrate 
of the invention comprising such a label L leads to the formation of a functional transcription 
10 factor and gene expression only in the case where this molecule binds to its target protein Y 
present in the cDNA library and fused to the activation domain. If gene expression is 
coupled to a selective advantage, the corresponding host carrying the plasmid with the gene 
coding for the target protein Y of the drug or bioactive molecule can be identified. 

15 In a further specific example the label L is a library of chemical molecules. The library is 
expected to contain yet unidentified compounds that bind to a known drug target protein Y 
under in vivo conditions. The target protein Y is fused to the activation domain of a 
transcription factor and the AGT is fused to the DNA binding domain of a transcription factor. 
Adding the substrate carrying the library of chemical compounds leads to the formation of a 

20 functional transcription factor and gene expression only in the case where the label (i.e. a 

compound in the chemical library) binds to its target protein Y fused to the activation domain. 
If gene expression is coupled to a selective advantage, those molecules of the library leading 
to the growth of the host can be identified. 

25 In the case where L is a bond connecting Ft* to Ri forming a cyclic substrate, a preferred 
compound is the cyclic substrate wherein the bond from R4 to Ri is a bond connecting the 
linker R4 to an amino group R 6 as defined under formula 2. In such a preferred cyclic 
substrate, R 2 is preferably an oligonucleotide, i.e. a 3-D-2'-deoxyribosyl being incorporated 
into a single stranded oligodeoxyribonucleotide having a length of 2 to 99 nucleotides as 

30 detailed above. This oligonucleotide may be further chemically modified so that it can be 

detected and functions therefore as a label. The chemical modification of substituents might 
be of the same nature as mentioned above for the label L 

In the case where L is a further group -R 3 -CH 2 -X-Ri-R 2 , the substrate is a dimeric compound 
35 leading to a dimerised fusion protein on reaction with a fusion protein comprising AGT. In 
the subunit L as a residue -R 3 -CH2-X-R 1 -R 2 , the meaning of R 1f R 2l R 3 and X may be 
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identical with the corresponding meaning in the other group R2-R1-X-CH2-R3-. representing a 
homodimer, or different, representing a heterodimer. 

Most preferred are compounds of formula 1 shown in the Schemes and described in the 
5 Examples. 

Methods of manufacture of novel substrates and intermediates are also an object of this 
invention. These methods are generally known in the art, are chosen as to best produce the 
preferred substrates of the invention, and are exemplified hereinbelow. In particular, the 

10 invention concerns a method wherein a compound of the formula R2-R1-Y, wherein Y is a 
reactive leaving group, e.g. an ammonium salt, is treated with a compound of formula 
HX-CH2-R3-R4-L or a precursor thereof, and the resulting compound is further manipulated, 
e.g. by elaboration of the linker R4 and/or the introduction of a label L. Alternatively, a 
compound of the formula Rz-Ri-XH is alkylated with a compound of the formula Y-CH2-R3-R4- 

1 5 L, wherein Y is a leaving group, e.g. halogen, or a precursor thereof, and the resulting 

compound is further manipulated, e.g. by elaboration of the residue R 3 and/or the linker Ft,, 
and/or the introduction of a label L. 

In particular, the invention concerns methods of manufacture of novel substrates carrying 
20 more than one label. Thus the synthesis proceeds through a core compound of the formula 
R2-R1-X-CH2-R3-R4 wherein R, is a polyfunctional residue having two or more reactive 
nucleophilic or electrophilic groups. Examples of such groups include hydroxy, amino, 
sulfhydryl, halo or carboxy. Further examples of reactive groups are Michael acceptors such 
as maleimides or vinylsulfones. In addition, functional groups which can take part in 
25 cycloaddition reactions are also included, such as dienes or dienophiles, azides, nitrones, 
nitriloxides, acetylenes, and alkenes. 

The specific functional groups, which can be placed on the residue R,, in particular at the 
outer sphere of a branched residue, for the attachment of suitable labels are limited only by 

30 the requirement that they are not supposed to be reactive with each other or with any 

compound or reagent used to build the substrate of the invention. Thus their selection will 
depend upon the specific reagents chosen to build the desired substrates. Examples of 
functional groups which can be provided on the outer sphere of a branched residue R4 
include fluoro, chloro, bromo, cyano, nitro, amino, alkylcarbonylamino, arylcarbonylamino. 

35 alkoxycarbonylamino, aryloxycarbonylamino, carboxy, carbamoyl, alkoxycarbonyl, 

aryloxycarbonyl, carbaldehyde, hydroxy, alkoxy, aryloxy, alkylcarbonyloxy, arylcarbonyloxy, a 
carbon-carbon double bond, a carbon-carbon triple bond, and the like. Most preferable 
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examples include amino, hydroxy, cyano (a latent carboxy function), carbamoyl, 
carbaldehyde, or a carbon-carbon double bond. 

In particular, the preferred synthesis of a branched residue R4 carrying a plurality of labels L 
5 makes use of orthogonally protected functional groups. Such a choice of protective groups 
allows for a separate deprotection so that each released functionality in turn can be further 
chemically manipulated either to attach a label to it or for the introduction of further extension 
of linker R4. Appropriate protecting groups for the envisioned functionalities can be chosen 
by those skilled in the art, and are e.g. summarized in T.W. Greene and P.G.M. Wuts in 
10 "Protective Groups in Organic Synthesis", John Wiley & Sons, New York 1991 . 

Preferred intermediates according to the invention are compounds of formula 1 wherein 
R1-R2 is a radical of formula 2 wherein R 2 is hydrogen, R5 is hydrogen and Re is 
unsubstituted amino; and X is oxygen; R 3 is triazoiyl, tetrazolyl, isoxazolyl, thienyl, 
15 isoxazolidinyl or alkynyl; R4 is a linker; and L is amino or azido, in particular amino. 

Other preferred intermediates according to the invention are compounds of formula 1 
wherein R r R 2 is a radical of formula 2 wherein R 2 is hydrogen, R 5 is hydrogen and Re is 
unsubstituted amino; and X is oxygen; R3 is 1 ,4-phenylene; R4 is a straight chain alkylene 
20 group of 10 to 40 carbon atoms optionally substituted by oxo wherein up to 12 carbon atoms 
are replaced by oxygen and zero, one or two carbon atoms are replaced by nitrogen; and L 
is amino or azido, in particular amino. 

Most preferred are the intermediates described in the Schemes and Examples. 

25 

The synthesis of an intermediate useful in the synthesis of compounds of formula 1 wherein 
R 3 is a tetrazolyl group, an isoxazolyl group or an isoxazolidinyl group is summarized in 
Scheme 1 and 2. 

30 The azido compound 7 is prepared from commercially available tetraethylene glycol 5 by 
mesylation (methanesulfonyl chloride, triethylamine) followed by reaction with sodium azide 
in ethanol. 7 is again mesylated and subjected to a Gabriel amine synthesis to give azido- 
amine 9 (Carolay et a/., J. Org. Chem. 56: 4326-4329, 1991). The Cu(l)-catalyzed 1,3- 
dipolar cycloaddition between azide 9 and the acetylene derivative 10 (Griffin et a/., J. Med. 

35 Chem. 43: 4071-4083, 2000) yields the 1 ,4-substituted triazole 11. Alternatively the azide 9 
and the cyano derivative 12 react under Lewis acid catalysis (ZnBr 2 ) to form tetrazole 13 
(Scheme 1 ). 
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Azide 7 is transformed to the central building block, the aldehyde 14, by means of a Swems 
oxidation (oxalyl chloride, DMSO, triethylamine). The reaction of 14 with a hydroxylamine 
derivative yields the nitrone 17, which upon reaction with the acetylene derivative 10 forms 
the class of isoxazolidines 18. From aldehyde 14, the oxime 15 is formed as an equimolar 
mixture of isomers. The corresponding nitrile-oxide is formed in situ by oxidation with sodium 
hypochlorite followed by reaction with 10 to yield the isoxazole 16 (Scheme 2). 
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Scheme 2 
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The synthesis of an intermediate useful in the synthesis of compounds of formula 1 wherein 
R3 is a thienyl group is summarized in Scheme 3. 
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The commercially available tetraethylene glycol 5 is monofunctionalized through the reaction 
with one equivalent of allyl iodide under strongly basic conditions to yield 22 which is further 
dimethoxytrityl (DMT)-protected to 23. This intermediate allows the palladium catalyzed 
Suzuki coupling with thiophene derivative 21 to the fully protected compound 25. 

5 

Monodeprotection of the DMT-group and subsequent mesylation (methanesulfonyl chloride, 
triethylamine) followed by the reaction with sodium azide in ethanol gives the protected azide 
which is deprotected with HF/pyridine to 26. Coupling of the free hydroxy group with the 
activated guanine-cation 27 leads to an azido-intermediate which serves as a precursor for 
10 different functionalization strategies. Finally reduction of the azide to amine 28 allows the 
introduction of the label unit L or the coupling to different surfaces. 
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The synthesis of an intermediate useful in the synthesis of compounds of formula 1 wherein 
R 3 is a phenylene group is summarized in Scheme 4. 



Scheme 4 
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A compound of formula 1 wherein Ri is guanine, R 2 is hydrogen, R 3 is 1 ,4-phenylene, R4 is a 
unit composed of two triazolyl-4-methoxy units and a tetraethyleneoxy unit and L is -R 3 -CH 2 - 
X-Ri-R 2 is prepared as shown in Scheme 5: 

Scheme 5 




37 



Compounds of formula 1 wherein Rt is guanine, R 2 is hydrogen, R 3 is 1 ,4-phenylene, R4 is a 
tetraethyleneoxy unit further comprising a triazolyI-4-methoxy group and L is -R 3 -CH 2 -X-Ri- 
R 2 is prepared as shown in Scheme 6, 7 and 8: 
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The synthesis of compounds of formula 1 wherein R n is guanine, R 2 is hydrogen, R 3 is 1,4- 
phenylene, R4 is a tetraethyleneoxy unit further comprising a 6-aminocaproyl group and L 
represents a combination of two different affinity labels is exemplified in Schemes 9 to 11. 



For the tri-orthogonal protecting group strategy, e-N-Fmoc-lysine 47 is converted to the 
corresponding cc-azido-e-N-Fmoc-lysine 48 by the diazo transfer method (Lundquist et aL, J. 

10 Org. Lett. 3: 781-783, 2001). Condensation of a-azido-e-N-Fmoc-lysine with an appropriate 
functionalized amino-benzylguanine derivative (e.g. 49) under standard peptide coupling 
conditions leads to the fully protected molecular scaffold 50. The cc-azido group is reduced to 
the corresponding amine 51 under neutral conditions with trimethylphosphine in aqueous 
1,4-dioxane, giving a compound suitable for condensation with a variety of different building 

15 blocks (containing e.g. affinity labels). For example, coupling with N-(+)-biotin-6-amino- 

caproic acid N-succinimidyl ester to give 52 followed by Fmoc-deprotection with diethylamine 
yields intermediate 53 with a free amino group which can be used to couple further labels to 
the central unit. Digoxygenin-3-OmethyIcarbonyl-e-aminocaproic acid-N-hydroxy- 
succinimide ester is coupled to the free amino group of 53 leading to the double labelled 

20 substrate 54 (Scheme 9). 
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Scheme 9 
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A different orthogonal protection strategy is illustrated in Scheme 10 for an alternative 
synthesis of the double labelled substrate 54. Condensation of oc-N-Fmoc-E-N-Dde-lysine 55 
with an appropriate functionalized amino-benzylguanine derivatives (e.g. 49) under standard 
peptide coupling conditions leads to the fully protected molecular scaffold 56. Fmoc- 
5 deprotection with diethylamine followed by coupling of N-(+)-biotin-6-aminocaproic acid N- 
succinimidyl ester yields intermediate 58. The remaining Dde-protecting group is cleaved by 
using a 4% hydrazine solution liberating the second amino group to give compound 53 which 
can be further derivatized. 

10 Scheme 1 1 




Scheme 1 1 shows the synthesis of two more examples of benzylguanine derivatives bearing 
15 a different combination of labels (two affinity labels biotin and dinitrophenyl in 59, or an 
affinity label combined with a fluorophore in 60). Starting from 53 (see Scheme 9) the 
coupling of 2,4-dinitrofluorbenzene to the free amino group yields 59. Coupling of the free 
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amino group in 53 with fluorescein-5(6)-carboxamidocaproic acid N-succinimidyl ester gives 
60. 

A further compound of formula 1 wherein R n is guanine, R 2 is hydrogen, R 3 is 1 ,4-phenylene, 
5 R4 is a dimeric tetraethyleneoxy unit coupled through a urea function and L is -Ra-CH^X-Rt- 
R 2 is prepared as shown in Scheme 12: 

Scheme 12 
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The synthesis of a compound of formula 1 wherein is guanine, R 2 is hydrogen, R 3 is 
propargyl, R4 is a tetraethyleneoxy unit further comprising a 6-aminocaproyl group and L is 
15 biotin is shown in Scheme 13 and 14: 
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Scheme 13 
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The synthesis of a benzylguanine derivatives with an enzyme inhibitor as a label L acting as 
a small molecule to induce protein dimerization in living cells based on the covalent labeling 
effusion proteins with synthetic ligands capable of interacting with other proteins is shown in 
Scheme 15 and 16. In particular, the synthesis of a compound of formula 1 wherein Ri is 
5 guanine, R2 is hydrogen, R 3 is 1 ,4-phenylene, R4 is either a w-amino-dodecanoylamino- 
methyl residue or a tetraethyleneoxy unit further comprising a glutamic acid unit and L is 
methotrexate is described. Methotrexate is a tight-binding inhibitor of the enzyme 
dihydrofolate reductase, and these heterodimeric structures belong to the well known class 
of compounds of so-called "chemical inducers of dimerization" (CIDs). 



10 
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Compounds of formula 1 wherein Ri is guanine, R 2 is hydrogen, R3 is 1 ,4-phenylene, R4 is a 
but-2-en-1 ,4-diol derivative and L is -R3-CH2-X-FVR2 is prepared as shown in Scheme 1 7 
and 18. 



Scheme 17 
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It is assumed that the polar solvents necessary to dissolve 0 6 -benzylguanine derivatives are 
not compatible with the olefin metathesis catalyst benzylidene-bis(tricyclohexylphosphine)- 
dichlororuthenium. Therefore, first the protected symmetric core units 81 and 85 are built 
and, after deprotection, the final bis-benzylguanines 82 and 86 are synthesized. 

Particularly preferred compounds of the invention are those shown in Scheme 1 to 18, and 
compounds described in the Examples. 
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Examples 

Example 1 : Preparation of glass slides for the covalent attachment of AGT substrates and 
subsequent covalent immobilization of AGT fusion proteins for the preparation of protein 
5 microarravs 

A commercially available microscope glass slide (Si0 2 ) is cleaned thoroughly with 
dichloromethane, acetone, H2O2/H2SO4 in an ultrasonic bath, and bi-distilled water. It is 
aminosilylated using 3-aminopropyltriethoxysilane in a solvent mixture ethanol/water (95:5) 
for 1 h following a published procedure, then treated with a solution of disuccinimidyl 

10 glutarate (10 mM) in dichloromethane/diisopropylethylamine (100:1) for 2 h under argon at 
room temperature. The surface is washed several times with dichloromethane. The glass 
surface bearing activated carboxy functions is incubated for 4 h with a solution of a 
compound of formula 1 bearing a free amino group instead of label L, such as compound 11, 
13, 16, 18, 28, 34, 49, 68 or 73, in methanol in a concentration of 10 mM, and supplemented 

1 5 with triethylamine. The slides are washed at least three times with methanol to yield a 

surface with the corresponding AGT-substrate covalently attached by an amide bond. To 
avoid side reactions in further use of the slides, all unreacted succinimidyl groups are 
quenched by addition of 6-aminohexanol (100 mM in DMF). 

20 Example 2: 1 1-Azido-3.6.9-trioxaundecanol (7) and 1 .1 1-diazido-3.6.9-trioxaundecane (36) 
A solution of 50.0 g (260 mM) tetraethylene glycol and 50 mL triethylamine in 200 mL dry 
diethyl ether is cooled to 0°C under an argon atmosphere, and 15.0 g (130 mM) methane- 
sulfonyl chloride is added over a 3 h period and stirred at room temperature for 20 min. The 
solvent is removed in vacuo, and 300 mL 95% ethanol and 18.0 g (280 mM) sodium azide 

25 are added. The mixture is heated to reflux for 24 h, cooled to room temperature and 
concentrated in vacuo. The remaining mixture is diluted with 400 mL dichloromethane, 
washed with brine and dried over MgS0 4 . After concentration in vacuo the crude mixture of 
mono- and diazide is purified by silica gel chromatography (petrol ether/ethyl acetate 3:1) 
yielding 15.03 g (68.5 mmol, 26%) monoazide and 3.46 g (14.18 mmol, 5.5%) diazide. 

30 

Example 3: 1-AzidcM1-phtalimido-3.6,9-trioxaundecane (8) 

A solution of 1.17 g (5.35 mmol) 1 1-azido-3,6,9-trioxaundecanol (7) and 1.2 mL triethylamine 
in 35 mL methylene chloride is cooled to 0°C, and 0.5 mL (6.45 mmol) methanesulfonyl 
chloride is added dropwise via a syringe over a 20 min period. The mixture is warmed to 
35 room temperature and stirred for 1 .5 h. The mixture is then washed twice with 10 mL of 
saturated aqueous NaHC0 3 and three times with 5 mL of water. The organic layer is dried 
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and concentrated in vacuo to yield 1 .5 g 8 as a yellow oil which is used without further 
purification. 

Example 4: 4-Bromothenvl alcohol (201 
5 5.0 g (26.17 mmol) 4-Bromothiophene-2-carboxaldehyde (19) is dissolved in 75 mL iso- 
propanol, and 1 .1 1 g (29.31 mmol) NaBH 4 is added at once and the mixture stirred for 2 h. 
20 mL saturated aqueous NH 4 CI is added, the solid removed by filtration and the mixture 
concentrated in vacuo. The product is purified by silica gel chromatography (petrol ether/ 
ethyl acetate 10:1), yielding 4.64 g 20 (24.07 mmol, 92%) as a colorless solid. 

10 

Example 5: 4.7.1 Q.13-Tetraoxa-1-pentadecen-15-ol (22) 

2.3 g (19.5 mmol) Potassium tert-butoxide is dissolved in 500 mL dry THF, and 7.18 g 
(37 mmol) tetraethylene glycol is added dropwise. After stirring for 30 min, a solution of 
3.31 g (19.7 mmol) allyl iodide in 60 mL dry THF is added over 1 h. and stirring is continued 
1 5 for 24 h. The crude mixture is filtered over silica gel and the solvent removed in vacuo. The 
product is purified by silica gel chromatography (gradient: petrol ether/ethyl acetate 10:1 
ethyl acetate), yielding 2.41 g 22 (10.3 mmol, 27%) as a colorless liquid. 

Example 6: 1-f2-Amino-7H- purin-6-vlV1-methvl-Dvrrolidinium chloride (27) 
20 1 .0 g (5.9 mmol) 6-Chloroguanine is dissolved in 40 mL DMF at 40°C. After cooling to room 
temperature, 1.4 mL 1-methylpyrrolidine (13.2 mmol) is added, and the reaction mixture is 
stirred for 18 h. 2 mL of acetone are added to complete precipitation. The solid is filtered, 
washed with ether and dried in vacuo, yielding 1.03 g 27 (3.9 mmol, 66%). 

25 Example 7: Q 6 -(4-Aminomethvl-benzvl)auanine (32) 

a) 4-Aminomethyl-benzyl alcohol: 2.83 g LiAIH 4 (74.5 mmol) are suspended in 150 mL dry 
ether, and 1.9 mL H2SO4 (100 %, 37.2 mmol) are added dropwise and under cooling. The 
mixture is stirred for 1 h at room temperature, followed by dropwise addition of 2.0 g (12.4 
mmol) 4-cyanobenzoate in 12 mL ether. After 2 h of refluxing the reaction is quenched with 

30 20 mL water followed by 7.4 g NaOH in 60 mL water. The organic layer is decanted, and the 
aqueous layer extracted with ether and ethyl acetate. The organic layer is dried over MgS0 4 , 
the solvent is removed and the product dried in vacuo, yielding 0.92 g (6.7 mmol, 54%). 

b) 2,2,2-Trifluoro-N-(4-hydroxymethyl-benzyl)-acetamide: To a solution of 866 mg (6.3 
mmol) 4-aminomethyl-benzyl alcohol and 880 jiL (6.3 mmol) triethylamine in 10 mL dry 

35 methanol 980 jiL (8.2 mmol) trifluoroacetic acid ethyl ester are added dropwise. The reaction 
mixture is stirred for 45 min, and diluted with 10 mL ethyl acetate and 10 mL water. The 
aqueous layer is extracted with ethyl acetate, and the combined organic layers are washed 
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with saturated NaCI and dried over Na 2 S0 4 . After removal of the solvents in vacuo the crude 
product is purified by flash column chromatography (ethyl acetate/cyclohexane 1:2). Yield: 
1.32 g (5.7 mmol, 90%). 

c) N-^a-Amino-yH-purin-e-yloxymethyO-benzyQ^^^-trifluoroacetamide: 592 mg 

5 (2.54 mmol) 2,2,2-trifluoro-N-(4-hydroxymethyl-benzyl)acetamide are dissolved in dry DMF 
under argon atmosphere, and 599 mg (5.33 mmol) potassium tert-butoxide are added. 300 
mg (1.18 mmol) 1-(2-amino-7H-purin-6-yl)-1-methylpyrroIidinium chloride (27) is then added 
and the solution stirred for 3 h. After removal of the solvent in vacuo the crude product is 
purified by flash column chromatography (300 mL methanol / dichloromethane 1 :50, 500 mL 
10 methanol/dichloromethane 1:10). Yield: 382 mg (1.04 mmol, 88%). 

d) 0 6 -(4-AminomethyI-benzyl)guanine (32): 335 mg (0.91 mmol) N-[4-(2-amino-7H-purin-6- 
yloxymethyl)-benzyl]-2,2,2-trifluoroacetamide is suspended in 34 mL methanol and 2 mL 
water. After addition of 656 mg (4.75 mmol) of K 2 C0 3 the reaction mixture is refluxed for 2 h. 
The solvents are removed in vacuo and the product is purified by flash column 

15 chromatography (methanol/triethylamine/dichloromethane 1:0.05:5). Yield: 209 mg 32 
(0.77 mmol, 85%). 

Example 8: Q 6 -(4-[Prop-2-vnvloxvmethvr|-benzvlkiuanine (35) 

662 mg (3.8 mmol) 4-(Prop-2-ynyloxymethyl)-benzyl alcohol (39) is dissolved in 3 mL dry 
20 DMSO, and 61 mg NaH are added in small portions over 5 min. 300 mg (1.27 mmol) 1-(2- 
amino-7H-purin-6-yl)-1-methylpyrrolidinium chloride (27) is added and the mixture is stirred 
for additional 4 h. The reaction is quenched with 0.2 mL of acetic acid, evaporated to 
dryness and purified by flash column chromatography (gradient: dichloromethane/methanol 
50:1 ->1 0:1) to yield 188 mg 35 (0.61 mmol, 53%). 

25 

Example 9: Homo-Q 6 -benzylQuanine-dimer 37 

To a solution of 50.0 mg (0.162 mmol) 0 6 -(4-[prop-2-ynyloxymethyl]-benzyI)guanine (35) and 
19.7 mg (0.081 mmol) 1,1 1-diazido-3,6,9-trioxaundecane (36) in 0.5 mL DMF a suspension 
of 15.43 mg (0.081 mmol) Cu 2 l2 in 0.15 mL of water is added. The mixture is stirred at room 
30 temperature for 24 h. 

Example 10: 4-(Prop-2-vnyloxymethvl)benzvl alcohol (39) and 1.4-bis-(prop-2-vnvloxv- 
methvDbenzene (40) 

To a solution of 2.5 g (18.1 mmol) 4-hydroxymethylbenzyl alcohol (38) 477.5 mg (19.9 mmol) 
35 NaH is added in small portions over 20 min. 2.15 mL of a propargyl bromide solution (80% in 
toluene) is added dropwise and stirred for 15 h. 100 mL of water are added to the mixture, 
and the products extracted with diethyl ether. The combined phase is dried and the solvent 
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removed in vacuo. The separation of the products is achieved by silica gel chromatography 
(petrol ether/ethyl acetate 4:1) yielding 1.08 g 39 (6.17 mmol, 34%) and 1.05 g 40 
(4.94 mmol, 27%). 

5 Example 1 1: 4-rftert-ButvldimethylsilvloxvVnethvnbenzvl alcohol (44) 

810 mg (33.77 mmol) NaH are suspended in 90 mL dry THF at room temperature, and 4.2 g 
(30.39 mmol) solid 4-hydroxymethyl-benzyl alcohol (38) is added in three portions over 5 
min, and the reaction mixture is stirred for 45 min. 4.83 g (32.08 mmol) tert-butyldimethylsilyl 
chloride are added in three portions over 5 min and stirred for an additional 1.5 h before the 
1 0 mixture is quenched with water and then diluted with 1 00 mL of water and 1 00 mL of diethyl 
ether. The organic phase is separated and the aqueous phase is extracted with diethyl 
ether. The combined organic phases are washed with brine, dried over MgS0 4 , filtered and 
concentrated in vacuo. The product is purified by flash chromatography (petrol ether/ethyl 
acetate 10:1 ) to yield 3.0 g 44 (1 1.88 mmol, 40%). 

15 

Example 12: 1-raert-Butvldi methvlsilvloxv)methvn-4-riodomethvnbenzene (45^ 
9.15 g (34.88 mmol) Triphenylphosphine and 3.2 g (44.5 mmol) imidazole are dissolved in a 
3:1 mixture of diethyl ether/acetonitrile (30 mL). 8.85 g (34.9 mmol) iodine is added under 
vigorous stirring until a yellow suspension has formed. A solution of 6.1 g (23.25 mmol) of 

20 the monoprotected benzyl alcohol 44 in 20 mL of the same solvent mixture is added, and the 
mixture is stirred for 2 h. The solid is removed by filtration, the filtrate diluted with 100 mL of 
diethyl ether and washed with 100 mL of a saturated solution of sodium bisulfite. The 
aqueous solution is back-extracted with diethyl ether, the combined organic phases dried 
over MgS0 4 and concentrated in vacuo. Flash chromatography (petrol ether/ethyl acetate 

25 95:5) yields 4.8 g 45 (13.25 mmol, 57%). 

Example 13: 4-f13-Azido-2.5.8.11-tetraoxatridecvn-benzvl alcohol (46) 

4.8 g (13.25 mmol) 1-[(tert-Butyldimethylsilyloxy)methyQ-4-(iodomethyl)benzene (45) is 

dissolved in 70 mL dry THF under argon, and 0.954 g (39.75 mmol) NaH is added in small 

30 portions over 10 min. A solution of 3.2 g (14.58 mmol) 1 1-azido-3,6.9-trioxaundecanol (7) in 
20 mL dry THF is added dropwise, and the reaction mixture stirred for 15 h at room 
temperature. 2 mL of water are added to quench the reaction, and the mixture is 
concentrated to about 50% under reduced pressure. 70 mL of water are added, and the 
reaction mixture extracted with diethyl ether. The organic phase is dried over MgS0 4 and the 

35 solvent removed. Purification by silica gel chromatography (gradient: petrol ether/ethyl 
acetate 10:1 3:1) yields 3.8 g (8.38 mmol, 63%) of the TBDMS-protected product. It is 
dissolved in 80 mL dry THF in a plastic tube, cooled to 0°C, and 8 mL of a pyridine/HF 
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(70:30) solution is added and stirred for 3 h at room temperature. 100 mL of aqueous 
saturated NaHC0 3 are added, the organic phase separated, washed with brine and dried 
over MgS0 4 . After removal of the solvent the product is purified by silica gel 
chromatography (petrol ether/ethyl acetate 1:1) to yield 1.27 g 46 (2.87 mmol, 74%). 

5 

Example 14: Q 6 -f4-(13-Azido-2.5.8.1 l-tetraoxatridecvQ-benzvllguanine (41) 
0.974 g (2.87 mmol) 4-(13-azido-2,5,8,11-tetraoxatridecyl)-benzyl alcohol (46) is dissolved in 
5 mL dry DMF, and 1.3 g (11.5 mmol) potassium tert-butoxide are added. 0.731 g (2.87 
mmol) 1«(2-amino-7H-purin-6-yl)-1-methyl-pyrrolidinium chloride is added and the solution 
10 stirred for 22 h. After removal of the solvent in vacuo the crude product is purified by flash 
column chromatography (methanol/dichloromethane 5:95). Yield: 0.675 g (50%). 

Example 15: Hetero-Q 6 -benzvlguanine-dimer 42 

To a solution of 45 mg (0.09 mmol) azide 41 and 29.5 mg (0.09 mmol) 0 6 -(4-[prop-2- 
15 ynyloxymethyl]-benzyl)guanine (35) in 1 ml water/acetonitrile 1:1 a suspension of 9.8 mg 
Cu 2 l2 in 0.1 mL water is added, and the reaction mixture is stirred for 24 h at room 
temperature. All insoluble material is filtered off, the solvents evaporated in vacuo and the 
product purified by flash column chromatograpy (methanol/dichloromethane 5:1) yielding 3.5 
mg (0.004 mmol, 10%). ESI/MS: 781.44 (M + ). 

20 

Example 16: Homo-Q 6 -benzvlquanine-dimer 43 

Azide 41 (100 mg, 0.21 mmol) and 1,4-(diprop-2-ynyloxymethyl)benzene (40) (22.6 mg, 
0.10 mmol) are dissolved in a solvent mixture consisting of 300 \iL ethanol, 200 pL water and 
500 jxL tert-butanol. 10 \iL CuS0 4 solution (40 mM), 5 mg Cu-wire and 3 mg Cu 2 l 2 are 
25 added, and the reaction mixture is stirred at room temperature for 48 h. All insoluble material 
is filtered off, the solvent evaporated in vacuo, and the product purified by flash column 
chromatography (chloroform / methanol 20:1 ) yielding 34 mg (28%) of dimer 43. ESI/MS: 
11 59.37 (M + ). 

30 Example 17: 11-Amino-1-azido-3.6.9-trioxaundecane (9) 

1-Azido-1 1-phtalimido-3,6,9-trioxaundecane (8, Example 3, 2.0 g, 5.74 mmol), is treated with 
1.0 ml 55% hydrazine hydrate in 50 ml abs. methanol and heated to reflux for 2 h. After 
cooling to room temperature, the mixture is concentrated in vacuo, diluted with 50 ml of water 
and treated with 5 ml cone. HCI. Ait precipitate is filtered off, the aqueous filtrate neutralized 

35 with NaOH, and the solvent removed in vacuo. The residue is dissolved in dichloromethane, 
and washed with 3 N NaOH and water. The organic phase is dried over MgS0 4 and 
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concentrated in vacuo yielding 0.63 g (2.9 mmol, 51%) of a colourless oil, which is used in 
the next step without further purification. 

Example 18: Q 6 -(1-ri1-AmincH3.6.9-^ (11) 
5 O e -Propargyl-guanine 10 (75 mg, 0.4 mmol) and azide 9 (44 mg, 2 mmol) are dissolved in a 
solvent mixture of 400 |iL N,N-dimethylacetamide, 100 pL acetonitrile and 100 jxL water. 
10 pi of a CuSCVsoIution (40 mM), Cu-wire (5 mg) and Cu 2 l 2 (2 mg) are added, and the 
reaction mixture stirred at room temperature for 48 h. All insoluble material is filtered off, and 
the solvents evaporated in vacuo. Flash chromatography (dichloromethane/methanol 20:1) 
10 yields 35 mg 11 (21%). 

Example 19: g-Azido-e-N-Fmoc-lvsine 48 

18 mL Triflyl azide (6.3 mmol, 2.3 eq. in dichloromethane) is added to s-N-Fmoc-lysine 47 
(1.10 g, 2.98 mmol) and CuS0 4 (8 mg) in 27 mL of a solvent mixture water / methanol 1:2. 

15 Diisopropylethylamine (971 juL, 5.58 mmol) is added, and the reaction mixture is stirred at 
room temperature over night. The organic solvents are removed in vacuo and 100 mL water 
is added. The pH is adjusted to 2.5 with 6 N HCI. After extraction with ethyl acetate the 
combined organic fractions are washed with water and dried (MgS0 4 ). Evaporation of the 
solvent affords the crude product as a yellow oil. Flash chromatography (dichloromethane/ 

20 methanol 97:3) gives 670 mg (59%) of a white solid. R f = 0.32 (dichloromethane/methanol 
95:5). ESI-MS (m/z) 395.23 [M+H] + (MW calcd. = 394.3). 

Example 20: Q 6 -r4-f13>Amino-2.5.8.11-tetraoxatridecvlVbenzvnquanine (49^ 
O^^IS-Azido^^.SJI-tetraoxatridecyl^benzylJguanine (41, 0.21 g, 0.44 mmol) is 

25 dissolved in 8 ml dry methanol by heating. Triethylamine (410 pL, 2.2 mmol) and 1 ,3- 
propanedithiol (225 pL, 2.2 mmol) are added under an argon atmosphere. The reaction 
mixture is heated at 40°C and stirred for 48 h. The solution is decanted, the solid washed 
with methanol, and the solvent evaporated in vacuo. The residue is purified by flash column 
chromatography (dichloromethane/methanol 95:5), and the title compound obtained as a 

30 pale yellow oil (120 mg, 0.26 mmol, 61%). R f = 0.01 (dichloromethane/methanol 9:1). 

1 H-NMR (400 MHz, DMSO-d 6 ) 6 7.83 (s, 1H), 7.47 (m, 2H), 7.36 (m, 2H), 6.27 (s, 2H), 5.48 
(s, 2H), 4.50 (s, 2H), 3.55 (m, 14H), 2.64 (t, 2H, J = 5.5 Hz). ESI/MS: 447 [M+H] + . 



35 



Example 21: 0 6 -Benzvlguanine-PEG-amino-a-azido-s-N-Fmoc-lvsine 50 
O e -Benzylguanine-PEG-amine 49 (76 mg, 0.17 mmol) and a-azido-e-N-Fmoc-lysine 48 
(67.2 mg, 0.17 mmol) are dissolved in 0.8 mL dry DMF, and 1-hydroxybenzotriazole (HOBT) 
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(380 pL, 0.38 mmol, 1 M in 1-methyl-2-pyrroIidone (NMP)) is added. The mixture is stirred at 
40°C for 2 h and transferred into 25 mL of diethyl ether to precipitate the crude product. The 
residue is dissolved in methanol, adsorbed on Si0 2 (0.2 g) and purified by flash column 
chromatography (dichloromethane/methanol 10:1) to obtain the O e -benzylguanine-PEG- 
amino-a-azido-e-N-Fmoc-lysine 50 as a pale yellow oil (81 mg, 0.098 mmol, 57%). ESI-MS 
(m/z) 823.3 [M+Hf (MW calcd. = 822.9). 

Example 22: Q 6 -Benzylquanine-PEG-amino-£-N-Fmoc-lvsine 51 

0 6 -Benzylguanine-PEG-amino-a-azido-8-N-Fmoc-lysine 50 (81 mg, 0.098 mmol) is dissolved 
in 400 \iL 1,4-dioxane / 100pL water, and 590 pL trimethylphosphine (1 M in THF) are added. 
After stirring at room temperature for 2 h, the product is precipitated in 20 mL of diethyl ether 
and dried in vacuo over night. ESI-MS (m/z) 797.35 [M+H] + (MW calcd. = 796.9). The 
product is used without further purification. 

Example 23: 0 6 -Benzvlauanine-PEG-amino-a-N-(biotinvl-6-aminocaprovl>-£-N-Fmoc-lysine 
52 

0 6 -Benzylguanine-PEG-amino-e-N-Fmoc-lysine 51 (78.4 mg, 0.098 mmol) and 30 jxL triethyl- 
amine are dissolved in 0.5 mL DMF. N-(+)-Biotin-6-aminocaproic acid N-succinimidyl ester 
(49.2 mg, 0.1 mmol) is dissolved in 0.5 mL DMF, and both solutions are combined and 
stirred at 40°C for 2 h. The reaction mixture is transferred into 30 mL of diethyl ether to 
precipitate the crude product. The residue is dissolved in methanol, adsorbed on Si0 2 
(0.2 g) and purified by flash column chromatography (dichloromethane/methanol 10:1) to 
yield 80 mg (0.07 mmol, 71%) of a colorless solid. ESI-MS (m/z) 1 136.38 [M+H] + (MW calcd. 
= 1135.38). 

Example 24: 0 6 -Benzvlquanine-PEG-amino-g-N-(biotinyl-6-aminocaprovn-lvsine 53 
0 6 -Benzylguanine-PEG-amino-a-N-(biotinyl-6-aminocaproyl)-e-N-Fmoc-lysine 52 (16 mg, 
0.014 mmol) and 50 |aL diethylamine are dissolved in 0.4 mL DMF and stirred at room 
temperature for 1 h. The product is precipitated with 20 mL diethyl ether and dried in vacuo. 
ESI-MS (m/z) 914.38 [M+H]+ (MW calcd. = 914.13). The product is used for the next step 
without further purification. 

Example 25: 0 6 -Benzvlquanine-PEG-amino-a-N-(biotinyl-6-aminocaDrovlV£-N-([diaoxiaenin- 
3-oxvmethvlcarbonvn-6-aminocaprovlVlvsine 54 

To a solution of O e -benzylguanine-PEG-amino-a-N-(biotinyl-6-aminocaproyl)-lysine 53 (6.93 
mg, 0.0075 mmol) and 10 jxL triethylamine in 0.4 mL dry DMF, (digoxygenin-3-oxymethyl- 
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carbonyl)-6-aminocaproic acid N-hydroxysuccinimide ester (5 mg, 0.0075 mmol) is added, 
and the reaction mixture stirred at 40°C for 2 h. The product is precipitated with 20 mL 
diethyl ether, dried in vacuo and purified via flash column chromatography using a gradient of 
5 to 15 % methanol in chloroform to yield 5 mg (0.0034 mmol, 45%) of the title product. ESI- 
5 MS (m/z) 1458.78 [M+H] + (MW calcd. = 1457.82). 

Example 26: Q 6 -Benzylguanine-PEG-amino-a-N-Fmoc-£-N-Dde-lvsine 56 
0 6 -Benzylguanine-PEG-amine 49 (50 mg, 0.11 mmol) and a-N-Fmoc-e-N-Dde-lysine 55 (58 
mg, 0.1 1 mmol) are dissolved in DMF (600 ill). HOBT (280 |xL of a 1 M solution in NMP, 
10 0.28 mmol) and 1-ethyl-3-(3-dimethylaminopropyl)carbodiimide (EDC, 54 mg, 0.28 mmol) are 
added, and the reaction is stirred for 1 h at room temperature. Ethyl ether is added to the 
solution to precipitate the product, and the supernatant is removed via pipette. Residual 
solvent is evaporated in vacuo. The product is purified by column chromatography on silica 
gel with a gradient of 2 to 5 % methanol in dichloromethane (yield: 64 %). 

15 

Example 27: Q 6 -Benzylquanine-PEG-amino-e-N-Dde-lvsine 57 

0 6 -Benzylguanine-PEG-amino-a-N-Fmoc-e-N-Dde-lysine 56 (66 mg, 68.6 jamol) is dissolved 
in DMF (600 \iL), and diethylamine (71 \iL, 686 jimol) is added. The reaction mixture is 
stirred for 1 h at room temperature, and the solvent removed in vacuo. The product is used 
20 for the next step without further purification. 

Example 28: Q 6 -BenzYlauanine-PEG-amino-g-N-(biotinvl-6-aminocaDrovn-£-N-Dde-lysine 58 
O s -BenzyIguanine-PEG-amino-e-N-Dde-lysine 57 and N-{+)-biotinyl-6-aminocaproic acid N- 
succinimidyl ester (31 mg, 68.6 nmol) are dissolved in DMF (700 jxL). Triethylamine (30 jiL) 
25 is added, and the reaction stirred for 2 h. Diethyl ether is added to the solution to precipitate 
the product, and the supernatant is removed via pipette. Residual solvent is evaporated in 
vacuo. Purification of the product is achieved by column chromatography on silica gel 
(gradient of 2 to 3 % methanol in chloroform; yield: 37 % over 2 steps). 

30 Example 29: Q 6 -Benzvlquanine-PEG-amino-g-N-(biotinvl-6-aminocaDrovlVlvsine 53 

0 6 -BenzyIguanine-PEG-amino-a-N-(biotinyl-6-aminocaproyl)-£-N-Dde-lysine 58 (27 mg, 
25.0 n.mol) is stirred for 20 min in a solution of 2 % hydrazine in DMF (250 jxL). Acetone is 
added to the reaction mixture, and the mixture stirred for another 3 min. Ethyl ether is added 
to the solution to precipitate the product, and the supernatant is removed via pipette. 

35 Residual solvent is evaporated in vacuo. The product is used for the preparation of 
benzyIguanine-PEG-amino-a-N-(biotiny^ 



WO 2004/031405 



-49- 



PO7EP2003/010889 



carbonyl]-6-aminocaproyl)-lysine 54 according to Example 24 without further purification 
(yield: 40% over 2 steps). 

Example 30: Q 6 -Benzvlquanine-PEG-amino-a-N-(biotinvl-6-aminocaDrovlVg-N-f2.4- 
5 dinitrobenzovD-lvsine 59 

0 6 -Benzylguanine-PEG-amino-a-N-(biotinyl-6-aminocaproyl)-lysine 53 (16.9 mg, 18.5 ^imol) 
is dissolved in DMF (400 \xL), and 2,4-dinitrofluorbenzene (4.1 mg, 22 pmol) and triethyl- 
amine (40 \iL) are added. The reaction mixture is stirred at 45°C for 2 h. Residual solvent is 
evaporated in vacuo. The product is purified via flash column chromatography on silica gel 
10 using a stepwise gradient of 0 to 15 % methanol in chloroform (yield: 15 mg, 75 %). 

Example 31 : 0 6 -Ben2ylquanine-PEG-amino-g-N^biotinvl-6-aminocaDrovlV£-N-(6- 
[fluorescein-5-carbonyiyaminocaproyl)-lvsine 60 

0 6 -Benzylguanine-PEG-amino-a-N-(biotinyI-6-aminocaproyl)-lysine 53 (8.5 mg, 0.93 ^mol) 
1 5 and 6-(fluorescein-5-carbonyI)-aminocaproic acid N-hydroxysuccinimidyl ester together with 
triethylamine (10 pL) are stirred in 200 jiL dimethylacetamide at 40°C for 4 h. The product is 
precipitated in 8 mL diethyl ether and dried in vacuo. ESI-MS (m/z) 1385.38 [M+H] + (MW 
calcd. = 1385.13). 

20 Example 32: Bis-N.NXQ 6 -benzylquanine-PEGVurea 61 

Triethylamine (15 pJ) is added to a solution of 0 6 -[4-(13-amino-2,5,8,11-tetraoxatridecyl> 
benzyt]-guanine (49, 30 mg, 67 \ur\o\) and N,N'-disuccinimidylcarbonat (8.6 mg, 33 nmol) in 
0.8 ml dry DMF, and the reaction mixture stirred at room temperature for 24 h. The crude 
product is precipitated in diethyl ether (10 ml), adsorbed on Si0 2 and purified by flash column 

25 chromatography (methanol/dichloromethane 5:1). Yield: 8.5 mg (9.2 pmol, 27%). ESI/MS: 
919.78 [M+H] + f (MW calc. = 918.99). 

Example 33: 1-ftert-Butyldimethylsilyloxv)but-2-vn>4-ol (63) 

tert-Butyldimethylsilyl chloride (8.0 g, 53 mmol) is dissolved in dry dichloromethane, and 
30 added dropwise to a solution of 2-butyne-1,4~diol (13.71 g, 159.2 mmol, 3 equ.), 4-dimethyl- 
aminopyridine (1.29 g, 10.6 mmol, 0.2 equ.) and triethylamine (6.9 mL) in a solvent mixture 
of dichloromethane (200 mL) and THF (100 mL). The reaction mixture is stirred at room 
temperature for 15 h, washed with water and saturated ammonium chloride and dried over 
MgSQ 4 . Flash column chromatography (petrol ether/ethyl acetate 1:1) yields 7.2 g (67%). 
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Example 34: 1-(tert-But vldimethY lsil yloxvl-4-iodobut-2-vne (64) 

1-(tert-Butyldimethylsilyloxy)-but-2-yn-4-ol (63. 4.0 g, 19.96 mmol), imidazole (1.80 g, 26.45 
mmol), iodine (6.58 g, 25.95 mmol) and triphenylphosphine (6.80 g, 25.95 mmol) are treated 
in 100 ml dichloromethane according to J. Robertson era/. J. Chem. Soc, Perkin Trans. 1, 
3389-3369, 2000. Work-up yields 4.42 g (71 %) of a colourless oil. The product is used 
directly in the next step. 



Example 35: 4-(11-Azido-3, 6 l 9-trioxa undec v loxvV1-(tert-h.ityidimethvl s iMny V V2-butvn e (6R\ 
11-Azido-3,6,9-trioxaundecanol (7, 3.79 g, 17.02 mmol) is dissolved in dry THF, and sodium 
hydride (1.02 g, 42.45 mmol) is added in small portions. After stirring at room temperature 
for 40 min, 1-(tert-butyldimethylsilyloxy)-4-iodobut-2-yne (64, 4.4 g, 14.18 mmol) is added, 
and the reaction mixture stirred for additional 15 h. The reaction is quenched with water, and 
the amount of solvent reduced in vacuo to about 30%. Diethyl ether (50 mL) is added, the 
mixture washed with water, and the organic phase dried over MgS0 4 . After evaporation of 
the solvent, the product is purified by flash column chromatography (petrol ether/ethyl 
acetate 4: 1 ) to yield 3. 1 2 g (54%). 

Example 36: (13-Azido-2.5.8.1 1-tetra nxatridecvllproparovl alrnhnl ( fifi) 
Tert-butyldimethylsilyl ether 65 (1.5 g, 3.37 mmol) is dissolved in 25 mL THF with 0.25 mL 
acetic acid, and treated with tetrabutylammonium fluoride (4.1 mL, 1 M in THF). The reaction 
mixture is stirred for 2 h, and the amount of solvent reduced in vacuo to about 30%. After 
addition of ethyl acetate (50 mL), the organic phase is washed with sat. sodium bicarbonate 
(20 mL) and brine (30 mL), and dried over MgS0 4 . The product is used in the next step 
without further purification. Yield: 0.774 g (80%). 

Example 37: Q 6 -K1 3-Azido-2.5.8. 1 1-tet raoxatridecvlVproDamvq -n. mnin^ f r?) 
Propargyl alcohol 66 (724 mg, 2.5 mmol) is dissolved in dry DMF (1 mL), and sodium hydride 
(181 mg, 7.55 mmol) is added in two portions. After stirring at room temperature for 30 min, 
H2-amino-7H-purin-6-yl)-1-methyl-pyrrolidinium chloride (27, 641 mg, 2.5 mmol) and 4- 
dimethylaminopyridine (100 mg, 0.81 mmol) are added. Stirring is continued for 15 h, and 
the product precipitated by the addition of 30 mL diethyl ether. Flash column 
chromatography (dichloromethane/methanol 95:5) yields 470 mg (44%) of a slightly yellow 
oil. ESI/MS: 421.29 [M+H] + . 



Example 38: Q 6 -f( 1 3-Amino-2S.fi 1 1- tetraoxatridecvlVproparq Y n- a uaninf» ( fift) 

Azide 67 (400 mg, 0.951 mmol) and triphenylphosphine (748 mg, 2.85 mmol) are stirred at 

room temperature for 15 h in a solvent mixture of 6 mL 1,4-dioxane / 400 uL water. The 
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solvent is removed in vacuo, and the residue purified via flash column chromatography 
(dichloromethane/methanol 10:1) yielding 312 mg (83%). 

Example 39: Q 6 -fri3-fBiotinyl-6-aminocaprovl-aminoV2.5.8,1 1-tetraoxatridecvlVproparqvn- 
5 guanine (69) 

Propargyl guanine 68 (26.9 mg, 88 pjnol) and (+)-biotinyl-6-aminohexanoic acid N-hydroxy- 
succinimide ester (20.0 mg, 44 pjnol) are stirred with 15 \iL triethylamine in 1 mL DMF at 
40°C for 3 h. The solvent is removed in vacuo, and the residue purified via flash column 
chromatography (dichloromethane/methanol 10:1) yielding 20 mg (68%). 

10 

Example 40: Q 6 -fl 1 3-(DiQoxiqenin-3-oxvmethylcarbonvl-6-aminocaproyl-aminoV2.5.8.1 1- 
tetraoxatridecvO-proparavn-auanine 70 

Propargyl guanine 68 (11.6 mg, 37 |JimoI), digoxygenin-3-oxymethylcarbonyl-6-aminocaproic 
acid N-hydroxysuccinimide ester (5 mg, 7.5 jxmol) and 10 pL triethylamine are stirred at 40°C 
15 for 2 h. The product is precipitated with 20 mL diethyl ether, dried in vacuo and purified via 
flash column chromatography using a gradient of 5 to 15 % methanol in chloroform to yield 
3.2 mg of the title compound (0.0034 mmol, 45%). ESI-MS (m/z) 939.78 [M+H] + (MW calcd. 
= 938.12). 

20 Example 41: Q 6 -f4-(12"Fmoc-amino-dodecanovnaminomethvnbenzvl guanine 72 
12-Fmoc-aminododecanoic acid (147 mg, 0.33 mmol) and 1-benzotriazolyloxy-tris- 
pyrrolidino-phosphonium hexafluorophosphate (PyBOP,193 mg, 0.37 mmol) are dissolved in 
1 1 ml DMF. After stirring at room temperature for 45 min 0 6 -4-aminomethyl-benzyl guanine 
(71, 100 mg, 0.37 mmol) in 5 mL DMF are added. The reaction mixture is stirred at 50°C for 

25 5 min, then at room temperature for 1 h. After evaporation of the solvent in vacuo the 

product is purified by flash column chromatography (gradient methanol/dichloromethane from 
1:50 to .1:10) to yield 185 mg (0.27 mmol, 79 %) of white solid product. R* = 0.33 (methanol/ 
dichloromethane 1:10). 1 H NMR (400 MHz, DMSO-d 6 ): 8 1.16-1.50 (m, 16H), 2.11 (t, J=7 Hz, 
2H), 2.95 (m, 2H), 2.96-3.09 (m, 2H), 3.58 (m, 1H) 4.20-4.28 (m, 5H), 5.45 (s, 2H), 6.28 (s, 

30 2H), 7.26-7.88 (m, 12H), 8.30 (t, J=6 Hz, 1H). Cl-MS 468 [(M-Fmoc)+Hf (MW calcd. = 
689.9). 

Example 42: Q 6 -f4-(12-AminododecanovlVaminomethvn-benzvl guanine (73) 
Fmoc-proteeted derivative 72(180 mg, 0.26 mmol) is dissolved in 3.2 mL DMF containing 
35 20% (v/v, 6.5 mmol) piperidine. After stirring at room temperature for 30 min the solvent is 
removed in vacuo and the product purified by flash column chromatography (gradient: 
methanol / dichloromethane from 1:20 to 1:5 containing 1% triethylamine) to obtain 95 mg 



WO 2004/031405 



PCT/EP2003/010889 



- 52 - 

(0.20 mmol, 78%) of a pale yellow solid. R f = 0.05 (methanol/triethylamine/dichloromethane 
20:1:100). 1 H NMR (400 MHz, DMSO-d 6 ): 8 1.23-1.50 (m, 18H), 2.11 (t, J=7 Hz, 2H), 2.58 (t, 
J=7 Hz, 2H), 4.24 (d, J=6 Hz, 2H), 5.45 (s, 2H), 6.27 (s, 2H), 7.35 (dd, J=8 Hz, 77 Hz, 4H), 
7.82 (s, 1H), 8.29 (t, J=6 Hz, 1H). ESI-MS (m/z) 468.8 [M+H] + (MW calcd. = 467.6). 

5 

Example 43: L-2-{4-fN-(2.4-Diamino-6-pteridinvlmeth^^ 
pentanedioic acid 1 -benzyl ester (76) 

L-Glutamic acid 1-benzylester (75, 114 mg, 0.48 mmol) and K 2 C0 3 (66.4 mg, 0.48 mmol) are 
added to 3 mL DMSO and sonicated at 60°C for 1 h. This mixture is added to a solution of 4- 

10 (N-[2,4-diamino-6-pteridinylmethyI3-methylamino)-benzoic acid 74 (150 mg, 0.46 mmol), 

diisopropylethylamine (112 pL, 0.65 mmol) and PyBOP (250 mg, 0.48 mmol) in 1.3 ml DMSO 
which had been stirred for 1 h. The reaction mixture is stirred at 50°C for 10 min and then at 
room temperature for 2.5 h. After quenching with 0.5 mL 1 M triethylamine the crude product 
is purified by HPLC with a linear gradient water/0.1% trifluoracetic acid (TFA) : acetonitrile/ 

15 0.08% TFA from 1 00:0 to 20:80. The solution is adjusted to pH 7, and the solvents are 

removed in vacuo to yield 50 mg (0.09 mmol, 20%) of an orange solid. 1 H NMR (400 MHz, 
DMSO-d 6 ): 8 1.90-2.05 (m, 2H), 2.30 (m, 2H), 2.78 (m, 1H) 3.21 (d, J=3 Hz, 3H), 3.28 (m, 
1H), 4.78 (s, 2H), 5.11 (s, 2H), 6.62 (broad s, 2H), 6.81-7.72 (m, 9H), 8.57 (m, 1H). ESI-MS 
(m/z) 545.4 [M+H] + (MW calcd. = 544.6). 

20 

Example 44: 0 6 -4-(M 2-N-(4-l4-fN-(2.4-Piamino-6-PteridinvlmethvlVmethvlamino1-benzoYl- 
amino)-4-benzvloxvcarbonvlbutanovh-aminododecanovn-aminomethvn-benzvl guanine (77) 
(Benzotriazol-1-yloxy)tripyrrolidino-phosphonium hexafluorophosphate (PyBOP, 152 mg, 
0.29 mmol) is dissolved in a solution of 1-hydroxybenzotriazole (HOBT) (146 pL of 1 M 

25 solution in 1 -methyl-2-pyrroIidone (NMP), 0.15 mmol) and added to benzyl ester 76 (380 pL 
of 0.154 M solution in NMP, 0.06 mmol). The mixture is shaken for 30 min and then added 
to amino-guanine 73 (27.2 mg, 0.06 mmol) in 200 pL NMP and diisopropylethylamine (30 \iL 3 
0.18 mmol). The reaction mixture is stirred at room temperature for 2 h and then quenched 
with 200 pL 1 M TFA. After removal of the solvent in vacuo, the product is purified by flash 

30 column chromatography (gradient methanol/dichloromethane, from 1:50 to 1:5, containing 
2% acetic acid). 19 mg (0.2 mmol, 33 %) of an orange solid are obtained. Rf = 0.08 
(methanol/acetic acid/dichloromethane 5:1:50). ESI-MS (m/z) 994.9 [M+H] + (MW calcd. = 
994.2). 
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Example 45: Q 6 -4-(M 2-NH444-fN-( 2.4-Diamino-6-pteridinvlmethyl Vmethvlaminol-benzovN 
aminoV4K:arboxvbutanovlVaminododecanovn-aminomethvlVbenzvl guanine (78) 
Benzyl ester 77 (13 mg, 0.01 mmol) is dissolved in 2 mL methanol/water 1:1, and K 2 C0 3 
(785 pL 1 M in methanol/water 1:1, 0.79 mmol) is added. The mixture is shaken at room 
5 temperature for 2.5 h followed by quenching with 785 pL of 1 M TFA. The crude product is 
purified by HPLC with a linear gradient water/0.1% TFA : acetonitrile/0.08% TFA from 100:0 
to 20:80. 10 mg (0.01 mmol, 85%) product are obtained. ESI-MS (m/z) 904.7 [M+H] + (MW 
calcd. = 904.0). 

10 Example 46: Q 6 -4-H 3-N44-{4-fN»(2.4-Diamino-6"pteridinvlmethvn-methvlaminoVbenzovl- 
amino}-4-carboxybutanovD-amino-2. 5.8.1 l-tetraoxatridecvll-benzyl guanine (79) 
PyBOP (201 mg, 0.39 mmol) and HOBT (194 pi of a 1 M solution in NMP, 0.19 mmol) are 
added to benzyl ester 76 (21 mg, 0.04 mmol), and the mixture shaken at room temperature 
for 30 min. O e -[4-(13-Amino-2,5,8,11-tetraoxatridecyl)-benzyl guanine (49, 21 mg, 0.05 

15 mmol) and diisopropylethylamine (24 pL, 0.14 mmol) are dissolved in 490 pL NMP and 

added to the solution of activated 76. After shaking at room temperature for 3 h the reaction 
mixture is quenched with 1 mL acetonitrile / 2 mL 1 M TFA, and the crude product purified by 
HPLC using a linear gradient water/0.1% TFA : acetonitrile/0.08% TFA from 100:0 to 20:80. 
Deprotection of the carboxy group is achieved by adjusting the pH of the solution to 10 with 

20 1.5 M NaOH. After purification of the crude product by HPLC using the same gradient as in 
the previous step, 14 mg (0.02 mmol, 40%) of the title compound are obtained. ESI-MS 
(m/z) 883.4 [M+H] + (MW calcd. = 882.9). 

Example 47: 4-Allvloxvmethv1-1-(tert-butvldimethylsilyloxvmethyn-benzene (80) 
25 To a solution of 4-(tert-butyldimethyIsilyloxymethyl)-benzyl alcohol 44 (2.85 g, 1 1 .31 mmol) in 
15 mL dry THF, sodium hydride (326 mg, 13.57 mmol) is added in small portions under an 
argon atmosphere, and the reaction mixture stirred at room temperature for 30 min. A 
solution of allyl iodide (3.80 g, 22.62 mmol) in 5 mL dry THF is added, and the resulting 
mixture stirred for 17 h. The reaction is quenched with water, the solvent reduced to about 
30 30% of its volume, and the residue extracted with ethyl acetate (3 times 20 mL). The organic 
phase is dried over MgS0 4 , the solvent removed in vacuo, and the product purified by flash 
column chromatography (petrol ether/ethyl acetate 100:1), yield 1.83 g (6.25 mmol, 59%). 

Example 48: 1 .4-Di-[4-(tert-butvldimethvlsilvloxymethvl Vbenzvloxvl-2-butene (81 ) 
35 BenzyIidene-bis(tricyclohexylphosphine)-dichlororuthenium (420 mg, 0.513 mmol, 15 moI%) 
is added under an argon atmosphere to a solution of 4-allyloxymethyH-(tert-butyldimethyl- 
si!yloxymethyl)-benzene (80, 1 .0 g, 3.24 mmol) in 70 mL dry dichloromethane. The reaction 
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mixture is stirred at room temperature for 15 h, and additional catalyst (200 mg) is added and 
the reaction mixture heated to reflux for 5 h. The solvent is removed in vacuo, and the 
product purified by column chromatography (petrol ether/ethyl acetate 50:1) to yield 520 mg 
(0.93 mmol, 55%) of the title compound. 

5 

Example 49: Bis-1 .4-(Q 8 -benzvlquanine>methoxvV2-butene 82 
TBDMS-dimer 81 (351 mg, 0.63 mmol) is dissolved in 5 mL dry THF, and tetrabutyl- 
ammonium fluoride (3.78 mmol) and acetic acid (3.78 mmol) are added. The reaction 
mixture is stirred at room temperature for 15 h. After evaporation of the solvent, the product 

10 is purified by column chromatography using a solvent gradient (petrol ether / ethyl acetate 
1:1 to 1:3) to yield 150 mg (0.45 mmol, 73%) of a colourless solid. To a solution of this 
deprotected dimer (100 mg, 0.30 mmol) in 0.8 mL dry DMF, sodium hydride (44 mg, 1.82 
mmol) is added, and the reaction mixture stirred for 30 min. 1-(2-Amino-7H-purin-6-yl)-1- 
methyl-pyrrolidinium chloride (27, 186 mg, 0.73 mmol) and DMAP (11 mg, 0.09 mmol) are 

15 added, and the reaction mixture stirred at room temperature for additional 3 h. The reaction 
is quenched with water and the solvent removed in vacuo. The residue is dissolved in 
methanol, adsorbed on Si0 2 and purified by flash column chromatography (dichloromethane/ 
methanol 10:1) to obtain dimer 82 as a colourless solid (30 mg, 0.05 mmol, 16%). 

20 Example 50: 1 1-Allvloxv-3.6.9-trioxaundecano) (83) 

To an anhydrous solution of potassium tert-butoxide (2.3 g, 19.5 mmol) in THF (500 mL), 
tetraethylene glycol (7.18 g, 37.0 mmol) is added at room temperature. The resulting mixture 
is stirred for 30 min, and a solution of ally! iodide (3.31 g, 19.7 mmol) in dry THF (60 mL) is 
added dropwise over 1 h. The reaction mixture is stirred for 30 h and filtered through a silica 

25 plug (10 g) using ethyl acetate (800 mL) to elute all product. After evaporation of the solvent 
in vacuo, the product is purified by column chromatography using a solvent gradient petrol 
ether/ethyl acetate 10:1 to ethyl acetate 100% yielding a colourless oil (2.41 g, 10.3 mmol F 
27%). 

30 Example 51: 1-aert-Butvldimethvlsilvloxvm 
benzene (84) 

To a solution of 1 1-allyloxy-3,6,9-trioxaundecanol (83, 2.33 g, 9.94 mmol) in dry THF (40 
mL), sodium hydride (0.95 g, 39.8 mmol) is added in several portions under an argon 
atmosphere. The mixture is stirred for 30 min. A solution of the fodo derivative 45 (3.0 g, 
35 8.28 mmol) in dry THF (10 mL) is added dropwise, and the mixture stirred for 15 h. The 

reaction is quenched by adding 2 mL of water, and 70% of the amount of THF is evaporated 
in vacuo. To dilute the mixture, 70 mL of water are added, and the solution extracted with 5 



\ 
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times 50 mL ethyl acetate. The combined organic extracts are dried over MgS0 4 and 
concentrated in vacuo. The yellow oil obtained is purified by flash column chromatography 
(gradient petrol ether/ethyl acetate 10:1 to 5:1) to give the title product 84 as a pale yellow oil 
(1.96 g, 4.2 mmol, 51%). 

5 

Example 52: TBPMS protected Bis-1 .4-(benzvl alcohol-PEG V2-butene 85 
Benzylidene-bis(tricyclohexylphosphine><lichlororuthenium (263 mg, 0.32 mmol, 15 mol%) is 
added under an argon atmosphere to a solution of TBDMS protected allyloxy derivative 84 
(1.0 g, 2.13 mmol) in 40 mL dry dichloromethane. The reaction mixture is heated to reflux for 
10 5 h. Additional catalyst (200 mg) is added and heating continued for 15 h. The solvent is 
removed in vacuo, and the product purified by column chromatography using a solvent 
gradient petrol ether/ethyl acetate 1:1 to 1:5, to yield 530 mg (0.58 mmol, 54%) of dimer 85. 

Example 53: Bis-1 .4-(Q 6 -benzYlquanine-PEG)-2-butene 86 

15 TBDMS protected dimer 85 (200 mg, 0.22 mmol) is dissolved in 3 mL dry THF, and tetra- 
butylammonium fluoride (2.5 mmol) and acetic acid (2.5 mmol) are added. The reaction 
mixture is stirred at room temperature for 15 h. After evaporation of the solvent, the product 
is purified by column chromatography using a solvent gradient petrol ether / ethyl acetate 1:1 
to ethyl acetate 100% to yield 80 mg (0.11 mmol, 50%) of a colourless solid. To a solution of 

20 this deprotected dimer (61 mg, 0.089 mmol), 1-(2-amino-7H-purin-6-yl)-1-methy!- 

pyrrolidinium chloride (27, 54 mg, 0.22 mmol) and DMAP (3.0 mg, 0.027 mmol) in 1.0 mL dry 
DMF, sodium hydride (13 mg, 0.54 mmol) is added, and the reaction mixture stirred for 2 h. 
The reaction is quenched with water, and the solvent removed in vacuo. The residue is 
dissolved in methanol, adsorbed on Si0 2 and purified by flash column chromatography 

25 (dichloromethane/methanol 10:1) to obtain dimer 86 as a colourless solid (35 mg, 0.036 
mmol, 41%). 

Example 54: Demonstration of reactivity of compounds of formula 1 with human O 6 - 
alkvlquanine-DNA alkvltransferase (hAGT) fusion proteins 

30 

a) Reactivity of O s -( 1 -pi 1>amino-3.6.9>trioxaundecvlM .2.3-triazolyl-4-methvlVquanine (1 1 ) 
18 pmol PGEA h AGT-G ST (A. Juilierat et al, Chem. Biol. 10: 313-317, 2003) are incubated with 
0.2 pL of a 5 mM solution of 11 in DMSO in a total volume of 15 pL of 50 mM HEPES pH 7.5 
for 1 h. 10 pmol of a biotinylated 0 6 -benzylguanine-oligonucleotide (R. Damoiseaux et a/., 
35 ChemBiochem 4: 285-287, 2001) is added, and the reaction is allowed to continue for further 
15 min. As a control 18 pmol PGEA hAGT-GST are incubated with 10 pmol of the same 
biotinylated oligonucleotide in a total volume of 15 \± of 50 mM HEPES pH 7.5 for 15 min 
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without any substrate. Detection of the biotinylated reaction product between PGEA hAGT-GST 
and the biotinylated oligonucleotide is performed by Western Blotting using an anti-biotin 
conjugate coupled to HRP (Pierce). Lane "C" of Figure 1 A shows the reaction product of 
PGEA hAGT-GST with the biotinylated oligonucleotide. If PGEA hAGT-GST is first reacted with 
5 11, no further reaction with biotinylated oligonucleotide is observed (lanes 1 and 2, Figure 
1A). This proves that 0 6 -triazolylmethylguanines are efficient substrates for hAGT. 



b) Reactivity of 0 6 -benzvlauanine-PEG-amino-g-N-(biotinvl-6-aminocaDrovl )-e-N-{6- 
ffluorescein-5^arbonyl]-aminocaprovlVlvsine 60 
10 18 pmol PGEA hAGT-GST are incubated with 0.5 nmol 60 in a total volume of 1 5 pL of 50 mM 
HEPES pH 7.5. The reaction product is detected by Western Blotting using a goat anti- 
fluorescein antibody (Rockland) and a HRP conjugated anti-goat antibody (Sigma). This 
proves that compound 60 containing two different labels is an efficient substrate for hAGT. 

15 Example 55: Immobilization of a human Q 6 -alkvlguanine-DNA alkvltransferase (hAGT) 
fusion protein on a solid phase 

80 of a solution of PGEA hAGT-GST fusion protein (200 jig/mL in HBS buffer) are incubated 
with 20 \iL of a 0.4 mM solution of O e -benzylguanine-PEG-amino-a-N-(biotinyl-6-amino- 
caproyl)-e-N-([digoxigenin-3-oxymethylcarbonyl]-6-aminocaproyl)-Iysine 54 for 10 min at 

20 room temperature. This solution is passed over a Neutravidin Biacore chip for 35 min at a 

flow rate of 1 pL/min. The chip is then purged with buffer (HBST) for 40 min at the same flow 
rate. Subsequently, a solution of ocGST-antibody (1980 p.g/ml_) is injected to further elucidate 
the previous binding of the fusion protein, followed again by washing with HBST. 
Immobilization experiments are measured by surface plasmon resonance. Surface plasmon 

25 resonance analysis is performed using a Biacore 1000 optical biosensor equipped with 
research grade SA sensor chips (Biacore). EDTA-free HBS (10 mM HEPES, pH 7.4, 150 
mM NaCI, 0.005% Tween 20) is freshly prepared and filtered through a 22 pm membrane. 
The resulting sensogram is shown in Figure 2. 

30 Example 56: Dimerization of a human Q 6 -alkvlquanine-DNA alkvltransferase (hAGT) 

a) Reaction of PGEG hAGT with the dimerizina substrate 43: 
Dimerization experiments are performed in a total volume of 40 jiL at a constant 
concentration of PGEG hAGT- GST (A. Juillerat etaL, Chem. Biol. 10: 313-317, 2003) of 0.4 jxM 
35 in HEPES-buffer (HEPES 50 mM, pH 7.2, 1 mM DTT, 200 pg/mL BSA). The concentration 
of compound 43 is varied to cover a ratio of substrate to protein from 0 to 100:1 . Incubation 



WO 2004/031405 



PCT/EP2003/010889 



- 57 - 

time is 40 min for all experiments. Western Blots are obtained using an anti-hAGT antibody 
(1:2000, in PBS, Chemicon) and an anti-mouse-HRP conjugate (1:2000 in PBS, Sigma) as a 
second antibody. The result is shown in Figure 3A. 

b) Reaction of HA- w160 hAGT with the dimerizing substrate 61 in E. coli extraxts 
Experiments are performed with the hAGT mutant Gly160Trp ( w160 hAGT) (M. Xu-Welliver et 
a/. f Biochemical Pharmacology 58: 1279-1285, 1999). HA- w160 hAGT is expressed in E. coli 
BL21 in LB medium at 24°C for 2 h after addition of IPTG (1 mM) and substrate 61 in 
different concentrations (0, 10, 20, and 50 pM) to an exponentially growing culture. Cells are 
harvested by centrifugation. For each labelling experiment the total volume of extract is 
about 1 ml (volume adjusted to a constant OD 60 o). The pellets are resuspended in 50 pi 
1xSDS followed by addition of 0 6 -benzylguanine to a concentration of 1 mM. After heating 
for 5 min to 95°C, samples are loaded to a 12% SDS-page. Western Blot is obtained using 
an goat anti-MGMT (1:100 in TBST, Chemicon) and an anti-goat IgG (1:1000 in TBST, Santa 
Cruz Biotechnology) as s second antibody. The result is shown in Fig. 3B. 

Example 57: Reactivity of 78 and 79 against GST- 3HY hAGT: 

To verify that 3HY hAGT possesses activity against 78 and 79, 3HY hAGT is expressed as fusion 
protein with glutathione S-transferase (GST- 3HY hAGT) in E. coli, the fusion protein purified 
and its activity determined in an in vitro assay. GST- 3HY hAGT is incubated with an excess of 
78 or 79 and the reaction is followed by Western Blotting using an anti-methotrexate 
antibody. In this assay, GST- 3HY hAGT shows second-order rate constants of about 1500 
sec" 1 M' 1 for the reaction with 78 and 900 sec 1 M" 1 for the reaction with 79. Accordingly, at 
concentrations of 78 of 1 jiM and at an excess of 78 over 3HY hAGT (pseudo first-order 
conditions), 50% of the protein is labelled in less than 10 min. 

Example 58: hAGT mutants 

For the construction of hAGT fusion proteins, preferably the mutant Asn157Gly Ser159Glu is 
used. This mutant shows increased activity against benzylguanine derivatives of the type 1 
by a factor of approximately 20 compared to wild-type hAGT, see A. Juillerat et ai, Chem. 
Biol. 10: 313-317, 2003. The following additional mutations in hAGT have been shown to 
disrupt DMA binding of hAGT but do not significantly interfere with the activity against 
benzylguanine derivatives: Lys125Ala, Ala127Thr and Arg128Ala, see A. Lim et a/.. EMBO J. 
15: 4050-4060, 1996, and D.S. Daniels etal., EMBO J. 19: 1719-1730, 2000. The rationale 
behind the introduction of these mutations is to minimize the interaction of hAGT fusion 
proteins with DNA in the three-hybrid system. The resulting hAGT mutant carrying the five 
mutations described above is abbreviated as 3HY hAGT. 
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Example 59: Growth assay of veast L40 strains 

Pairs of plasmids encoding LexA- 3HY hAGT and B42-DHFR fusion proteins are transformed 
into the yeast strain L40 in which the reconstitution of functional LexA-B42 transcription 
5 factor leads to transcription of the reporter genes HIS3 and lacZ. It is tested if the expression 
of different combinations of fusion protein in the presence of either 78 or 79 complemented 
the histidine auxotrophy of the yeast L40 (Table 1 ). Co-expression of LexA- 3HY hAGT and 
B42-DHFR allows yeast L40 to grow on plates lacking histidine but containing either 78 or 
79, indicating transcription of HIS3. No growth is observed in the absence of 78 or 79. 

10 

T a ple 1 : Growth rates of different yeast L40 strains on plates lacking histidine and in the 
presence and absence of compounds 78 or 79. 



Protein pair 


Benzyl 
guanine 


Growth rate 
a) 


LexA- 3HY hAGT7 B42-DHFR 


1 nM78 


+++ 


LexA- 3HY hAGT7 B42-DHFR 


10jiM79 


+++ 


LexA- 3HY hAGT7 B42-DHFR 






LexA-DHFR/ B42- 3HY hAGT 


1 uM 78 


++ 


LexA-DHFR/ B42- 3HY hAGT 


10|iM 79 


++ 


LexA-DHFR/ B42- 3HY hAGT 







a) +++: detection of colonies within 2 days; ++: detection of colonies within 3 days; 
15 +: detection of colonies within 5 days; -: no colonies within 7 days. 

In principal, the reconstitution of functional transcription factor through benzyl guanine 78 and 
79 is independent of which ligand-binding protein is fused to LexA and which to B42. To 
verify this assumption the growth of yeast L40 co-expressing LexA-DHFR and B42- 3HY hAGT 
20 on plates lacking histidine but containing compounds 78 or 79 is examined. As expected, 
growth dependent on compounds 78 or 79 is observed although the growth rates are below 
those observed for yeast co-expressing LexA- 3HY hAGT and B42-DHFR (Table 1 ). 
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Example 60: Quantitative ft-aalactosidase assay using ONPG 

The activation of transcription of the lacZ reporter gene by the benzyl guanine based 

compounds 78 and 79 in the yeast strain L40 expressing different combinations of LexA and 

B42 fusion proteins is examined (Table 2). In this assay, the activity of the product of the 

lacZ gene, p-galactosidase, is determined by measuring the hydrolysis rate of the 

chromogenic substrate o-nitrophenyl-p-D-galactopyranoside (ONPG) in cell extracts of liquid 

cultures. 



Table 2: Quantitative p-galactosidase assay using ONPG. 



Protein pair 


Benzyl 
guanine 


ONPG 
hydrolysis a) 


LexA- 3HY hAGT/ B42-DHFR 


1 H.M78 


865 


LexA- 3HY hAGT7 B42-DHFR 


1 H.M79 


63 


LexA- 3HY hAGT/ B42-DHFR 




4 


LexA-DHFR/ B42- 3HY hAGT 


1 [iM 78 


28 


LexA-DHFR/ B42- 3HY hAGT 


1 p.M79 


24 


LexA-DHFR/ B42- 3HY hAGT 




3 



a) nM of o-nitrophenol formed per min per mg total protein 



